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<210>SEQIDNO2 
<21T> LENGTH: 9 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic Peptide 
<400> SEQUENCE: 2 

Arg Tyr Phe Pro Asn Ala Pro Tyr Leu 

1 : 5 



02/04/2008 



> d his ful; d que sta 

(FILE 1 HOME 1 ENTERED AT 08:54:03 ON 04 FEB 2008) 

FILE ' REGISTRY 1 ENTERED AT 08:54:13 ON 04 FEB 2008 
LI 1 SEA ABB=ON PLUON RYFPNAPYL/SQS P 

L2 82 SEA ABB=ON PLU=ON RYFPNAPYL/ SQSFP 

L3 ANALYZE PLU=0N L2 1- LC : 5 TERMS 

D 

D L2 1-82 

D L2 SQIDE 

D L2 SQIDE 1- 

FILE ' ZCAPLUS' ENTERED AT 08:59:21 ON 04 FEB 2008 
L4 7 6 SEA ABB=ON PLU=0N L2 

D L4 1- IBIB KWIC 



FILE HOME 
FILE REGISTRY 

Property values tagged with IC are from the ZIC/VINITI data file 
provided by InfoChem. 

STRUCTURE FILE UPDATES: 3 FEB 2008 HIGHEST RN 1001389-12-3 
DICTIONARY FILE UPDATES: 3 FEB 2008 HIGHEST RN 1001389-12-3 

New CAS Information Use Policies, enter HELP USAGE-TERMS for details. 

TSCA INFORMATION NOW CURRENT THROUGH June 2 9, 2007 

Please note that search-term pricing does apply when 
conducting SmartSELECT searches. 

REGISTRY includes numerically searchable data for experimental and 
predicted properties as well as tags indicating availability of 
experimental property data in the original document. For information 
on property searching in REGISTRY, refer to: 

http: //www. cas . org/support /stngen/stndoc/properties . html 

FILE ZCAPLUS 

Copyright of the articles to which records in this database refer is 
held by the publishers listed in the PUBLISHER (PB) field (available 
for records published or updated in Chemical Abstracts after December 
26, 1996), unless otherwise indicated in the original publications. 
The CA Lexicon is the copyrighted intellectual property of the 
American Chemical Society and is provided to assist you in searching 
databases on STN. Any dissemination, distribution, copying, or storing 
of this information, without the prior written consent of CAS is 
strictly prohibited. 

FILE COVERS 1907 - 4 Feb 2008 VOL 148 ISS 6 
FILE LAST UPDATED: 3 Feb 2008 ( 20080203/ED) 

New CAS Information Use Policies, enter HELP USAGETERMS for details. 
This file contains CAS Registry Numbers for easy and accurate 
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substance identification . 



L2 82 SEA FILE=REGISTRY ABB=0N PLU=ON RYFPNAPYL/SQSFP 

L4 7 6 SEA FILE=ZCAPLUS ABB=0N PLU=ON L2 



Page 2 



02/04/2008 

=> d 12 sqide 

L2 'ANSWER 1 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 956835-95-3 REGISTRY 

CN Protein (human gene KIAA0226 fragment) (CA INDEX NAME) 
OTHER NAMES: 

CN 13: PN: WO2007128984 SEQID: 13 claimed protein 
FS PROTEIN SEQUENCE 
SQL 960 

PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source (Reference 

Not Given|WO2007128984 
I claimed SEQID 
113 



All 

%T 9t APPliDiutS 



SEQ 1 LLGNLKTTVE GLVSTNSPNV WSKYGGLERL CRDMQSILYH GLIRDQACRR 

51 QTDYWQFVKD IRWLSPHSAL HVEKFISVHE NDQSSADGAS ERAVAELWLQ 
101 HSLQYHCLSA QLRPLLGDRQ YIRKFYTDAA FLLSDAHVTA MLQCLEAVEQ 



151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 

HITS AT: 

MF Unspec 

CI MAN 

SR CA 

LC STN Fi 

DT.CA CApl 

RL.P Role 
PRP 



NNPRLLAQID 
GSFSSLHQSV 
VSVSALARDS 
GNLDPRGRTA 
SQLSSVLRRS 
IIVEDPIAES 
GMFRRPSEGQ 
MMSQCLEEEE 
FRVTSSSSQF 
SQSFSHCFLH 
PIPDSLPISP 
AVAKQNYRCA 
LRKWDFSKYY 
RVQLCHMKNM 
LGPRLAELTR 
EECKACYHKA 
ALEAAVLEAT 
124-132 
if ied 



ASMFARKHES 
PNNGSERRST 
PLTPNEMSSS 
SCQSHSSNAE 
SFSEGQTLTV 
CNDKAKLRGP 
SLISYLSEQD 
VEEEDSDREI 
SSRDSAQLSD 
STSAEAVAMG 
DDGQHADIYK 
GCGIRTDPDY 
VSNFSKDLLI 
FKTCRLAKEL 
AGATHVERCM 
CFKSGSCPRC 



PLLVTKSQSL 
SFPLSGPPRK 
TLTSPIEASW 
SSSSNLFSSS 
TSGAKKSHIR 
LPYSGQSSEV 
FGSCADLEKE 
QELKQKIRLR 
SGSADEVDEF 
LLKQFEGMQL 
LRIRVRGNLE 
IKRLRYCEYL 
KIWNDPLFNV 
LDSFDTVPGH 
LCQAKGFICE 
ERLQARREAL 



TALPSSTYTP 
PQESRGHVSP 
VSSQNDSPGD 
SSQKPDSAAS 
SHSDTSIASR 
STPSSLYMEY 
NAHFSISESL 
RQQIRTKNLL 
EIQDADIRRN 
PAASELEWLV 
WAPPRPQIIF 
GKYFCQCCHE 
QDINSALYRK 
LTEDLHLYSL 
FCQNEDDIIF 
ARQSLESYLS 



PNSYAQHSYF 
AEDQTIQAPP 
ASEGPEYLAI 
SLGDQEGGGE 
GAPGGPRNIT 
EGGRYLCSGE 
IAAIELMKCN 
PMYQEAEHGS 
TASSSKSFVS 
PEHDAPQKLL 
NVHPAPTRKI 
NAQMAIPSRV 
VKLLNQVRLL 
NDLTATRKGE 
PFELHKCRTC 
DYEEEPAEAL 



les: CA, CAPLUS, TOXCENTER 

us document type: Patent 

s from patents: ANST (Analytical 

(Properties); USES (Uses) 

1 REFERENCES IN FILE CA (1907 
1 REFERENCES IN FILE CAPLUS ( 



study) ; BIOL (Biological study) ; 

TO DATE) 
1907 TO DATE) 



=> d 12 sqide 1- 

YOU HAVE REQUESTED DATA FROM 82 ANSWERS 



L2 



CONTINUE? Y/ (N) :y 
ANSWER 1 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 



RN 956835-95-3 REGISTRY 

CN Protein (human gene KIAA0226 fragment) (CA INDEX NAME) 
OTHER NAMES: 
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CN 13: PN: WO2007128984 SEQID: 13 claimed protein 
FS PROTEIN SEQUENCE 
SQL 960 



PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source [Reference 



Not Given|WO2007128984 
| claimed SEQID 
113 



SEQ 1 LLGNLKTTVE GLVSTNSPNV WSKYGGLERL CRDMQSILYH GLIRDQACRR 

51 QTDYWQFVKD IRWLSPHSAL HVEKFISVHE NDQSSADGAS ERAVAELWLQ 
101 HSLQYHCLSA QLRPLLGDRQ YIRKFYTDAA FLLSDAHVTA MLQCLEAVEQ 



151 NNPRLLAQID ASMFARKHES PLLVTKSQSL TALPSSTYTP PNSYAQHSYF 

201 GSFSSLHQSV PNNGSERRST SFPLSGPPRK PQESRGHVSP AEDQTIQAPP 

251 VSVSALARDS PLTPNEMSSS TLTSPIEASW VSSQNDSPGD ASEGPEYLAI 

301 GNLDPRGRTA SCQSHSSNAE SSSSNLFSSS SSQKPDSAAS SLGDQEGGGE 

351 SQLSSVLRRS SFSEGQTLTV TSGAKKSHIR SHSDTSIASR GAPGGPRNIT 

401 IIVEDPIAES CNDKAKLRGP LPYSGQSSEV STPSSLYMEY EGGRYLCSGE 

4-51 GMFRRPSEGQ SLISYLSEQD FGSCADLEKE NAHFSISESL IAAIELMKCN 

501 MMSQCLEEEE VEEEDSDREI QELKQKIRLR RQQIRTKNLL PMYQEAEHGS 

551 FRVTSSSSQF SSRDSAQLSD SGSADEVDEF EIQDADIRRN TASSSKSFVS 

601 SQSFSHCFLH STSAEAVAMG LLKQFEGMQL PAASELEWLV PEHDAPQKLL 

651 PIPDSLPISP DDGQHADIYK LRIRVRGNLE WAPPRPQIIF NVHPAPTRKI 

701 AVAKQNYRCA GCGIRTDPDY IKRLRYCEYL GKYFCQCCHE NAQMAIPSRV 

751 LRKWDFSKYY VSNFSKDLLI KIWNDPLFNV QDINSALYRK VKLLNQVRLL 

•801 RVQLCHMKNM FKTCRLAKEL LDSFDTVPGH LTEDLHLYSL NDLTATRKGE 

851 LGPRLAELTR AGATHVERCM LCQAKGFICE FCQNEDDIIF PFELHKCRTC 

901 EECKACYHKA CFKSGSCPRC ERLQARREAL ARQSLESYLS DYEEEPAEAL 
951 ALEAAVLEAT 

HITS AT: 124-132 

MF Unspecified 

CI MAN 

SR CA 

LC STN Files: CA, CAPLUS, TOXCENTER 
DT.CA CAplus document type: Patent 

RL.P Roles from patents: ANST (Analytical study); BIOL (Biological study); 
PRP (Properties); USES (Uses) 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 2 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 937580-08-0 REGISTRY 

CN Signal transduction histidine kinase with Chase domain .( Bradyrhi zobium 

strain BTAil) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank ABQ38889 

CN GenBank ABQ38889 (Translated from: GenBank CP000494) 
FS PROTEIN SEQUENCE 
SQL 552 

SEQ 1 MLRLGVI IGV IALIGTALSG LAAYRVHDQE LAIDRIALAR AIDVHASLVQ 

51 DRLTERELLA RVASGLFRAP SVIKANMLEP LRSSI YAFKT DFVVASWIAR 
101 LRPDELPIAQ AELRQAGFPN PTIRNYDQKP LDAATLTGPI DVLMDLEPRN 
151 ADTLKLPGVA LDRQPIVGPM LTRAMAEGKP VASDPTPLLR ANGPIGIVLA 
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201 APVVPQGATA 

251 NDQGAVSGRT 

301 IVGVIGLALT 

351 RVKNILAVIQ 

4 01 GVQLRGLFES 

4 51 LSLVGKHPHI 

501 DRVAPEALGG 



PAGFVTFS YE 
RGADAPAPSS 
GIICGLFGYV 
SIVTRTLRHG 
RAIPHADRIA 
TAKWDVTGEE 
TSKRFFTEAS 



IGPLLLTNDD 
TRTVTFGNHD 
AYNNLRLSRE 
SDIDVARELL 
VTGPDITVSA 
PNAVFQFRWE 
YVYELTAPME 



LSLFSVALKD 
WSLVYYAKSN 
IQVRIGFERR 
IGRIHAMSNV 
RAAQSLSLLF 
EFNTSEATRR 
TVVDITERDR 



PRNASDELVA 
SVRRAQQTAA 
LTAVIDELNH 
VTLLSESQWQ 
FELASHSDEG 
PDSDFGLILL 
TEQFSAPVHP 



551 SR 
HITS AT: 514-522 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
' 1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER "3 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 937456-80-9 REGISTRY 

CN Signal transduction histidine kinase with Chase domain (Bradyrhizobium 

strain ORS278) , (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank CAL7 4 960 
CN GenBank CAL7 4 960 
FS PROTEIN SEQUENCE 
SQL 551 



(Translated from: GenBank CU2341I8) 



SEQ 1 MLRLGVI IGV 

51 DRLTERELLA 

101 LRPDELPVAQ 

151 ETSRLPGRAL 

201 PVVPQGATAP 

251 DQGAVTTRTR 

301 VGVIGLALTG 

351 VKNILAVIQS 

4 01 VQLRGLFEAR 

451 SLVGKHPHIT 

501 RVAPEALGGA 



IALLGTALSG 
RVASGLFRAP 
AELHQAGFSN 
DRQPIVGPML 
AGFVTFS YEI 
GPDDPAPSST 
IIGGLFGYVA 
IVTRTLRHGS 
AIPHAERIAV 
AKWEVTGDEP 
SKRFFTEGSY 



LAAYRVHDQE 
SVIKANMLEP 
PTIRSYDDTP 
ARAMAEGKPV 
GPLLLTNDDL 
RTVAFGNHDW 
YNNLRLSREI 
DIDVARELLI 
SGPDITVSAR 
NAVFNFRWEE 
VYELTAPMET 



LALDRIALAR 
LRSSI YAFKT 
LGPNVDHPID 
ASDPTPLLRP 
SLFSVALKDP 
SLVYYAKSNA 
QVRIGFERRL 
GRIHAMSNVV 
AAQSLSLLFF 
RNTSEATRRP 
VVDMTERDRT 



AIDVHASLVQ 
DFVVASWIAR 
VLMDLEPRNA 
DGPAGIVLAA 
RN AS DEL VAN 
ARRAQQTAAI 
TAVIDELNHR 
TLLSESQWQG 
ELASHSDEGL 
DSDFGLILLD 
EQFSAPVRPP 



551 K 

HITS AT: 513-521 - 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



PRP (Properties) 



L2 ANSWER 4 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 934937-94-7 REGISTRY 

CN Protein (Corynebacterium glutamicum strain R 300-amino acid) 

NAME) 
OTHER NAMES: 
CN GenBank BAF53882 

CN GenBank BAF53882 (Translated from: GenBank AP009044) 



(CA INDEX 
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FS PROTEIN SEQUENCE , ' 
SQL 300 

SEQ 1 MAFGYVLREA VRGMGRNVTM TIALIITTSI SLALLATGFL VTNMTDRTKD 

51 IYLDRVEVMI QLDEDTSAND PECTAESCTE VRDVLEGLDG IDSITYRSRE 

101 ASYERFVEVF KDTDPVLVAE TSPDALPAAF HVRLEDPLAV EILDPVRDLS 

151 QVSNVIDQVD DLRGATENLD SIRNATFLIA AVQVLAS I FL IANMVQIAAF 

201 NRREETEIMR IVGASRFYTQ GPFVFEAILS TLIGAVFAVG ALFLGKELVI 



251 DKALRGLYDS QLIAPVTTTD IWLVAPIISG IGVVIAGIIA QLTLRFYVRK 
HITS AT: 216-224 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 5 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 931173-35-2 REGISTRY 

CN Protein (Geobacillus thermodenitrif icans strain NG80-2 926-amino acid) 

(CA INDEX NAME) 
OTHER NAMES: 
CN GenBank ABO65802 

CN GenBank ABO65802 (Translated from: GenBank CP000557) 
FS PROTEIN SEQUENCE 
SQL 92 6 



SEQ 1 MNGKWKIAKL IGLIIFIVSL PVLFFAFIGQ NPMKQTNKAT NH I AWN EDM 

51 GAKYDQKTYF FGKEIVPALA DRSSYHWSVV NRSQAENGLK NNEYDAVVYL 

101 PSDFSRNILS FNDKQPLKAT VQYKIQPNLD AKHRERVQRE LEASKNMINQ 

151 KMSTLYWSYV AQEIAAIRKK FNDILEKEIA FQKAMYAFYT PSSAKLAEEI 

201 KRQKDMLQEL LEASNNAKDS SAGTLQGIEE AEAEITAFVD SLSQYREYQQ 

251 RQNELLQEGI HEYKKRFNER MDLMLGKPWE VKPDVRHQEQ NLLESITTLR 

301 NTVAESHTSL MTFNEQLKQS TVQEQFEQLL TWKKEFVRQY QIEVNNQTLD 

351 RLQQSLIEFR QKLQSPSPGQ TDEEQPAPIE APPQPADENL LDALQQQLAN 

401 LKAQWQAFKP QSPQESWEHI EGSISQLETE MEKTKQAWQQ QLALQQQWQQ 

4 51 KYAQLVEQLN KQLTEGKADS IDDIVQQIKE KEQAVLASPA LPESRKQVLS 

501 SHFEAVIQNR NTADLLDYFA WLSIFDEAVQ QTTRFDEELV DQLLMNWNQR 

551 DAI FQMLSDV RGYFEELEQH SASSLKEVEA AEESTESFIE TTLGYIQEYD 

601 ENVQKMQETI TGQLQELSDA VSEVTMQLQE AVNEGQQTEE IWRGNDGEFV 

651 ITIQQNTMQD VQQISDLIAS IAEDQDHIVD YTNELHEKID SVQTKADELN 

701 DKWAANVNTT KQIRNDVYRL LNNTMIDQQA NGYVYDYLTN PVQVRGDLPE 

751 EQTTYTPPVV VLIIVLLCGM LIGFFLHYYS NSPFMLQLAL LLLMNVVVGL 



801 IINIYSLKIY PMQDVRAIKW SVLTIVLLFF CSSMVRLAFL IGPFTGWILG 
851 VGLVLFFITP LLDLVLPNFH FEHPIAETYI SIQYGDQQIF .YSTVVMMGVL 
901 SLLMAAVPFL KHRLADQQEE GEMYEG 

HITS AT: 777-785 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 
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L2 ANSWER 6 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 927516-46-9 REGISTRY 

CN Protein (Yamadazyma stipite strain CBS 6054 431-amino acid) 

NAME) 
OTHER NAMES: 
CN GenBank ABN64 712 
CN GenBank ABN64712 
FS PROTEIN SEQUENCE 
SQL 431 



(Translated from: GenBank CP000496) 



(CA INDEX 



SEQ 1 MGCAASVPEN EEKDPFLHDK RINDAIEQNL QLNKQNEKNQ VKLLLLGAGE 

51 SGKSTVLKQM KLLHKGGFTQ QERIQYSQVI WCDVVQSMKI LIVQARKLGI 

101 ALDCDQENSP LIPYKQIVLR ANALEQIDTG AAGGAHFMND YVLKYSEVQK 

151 NKRRMNSTGV ANISYWDDPP VKSDHESDNV NVVYNSTNPT PNSTYSREQI 

201 ADAIHKLWTS DPGIHNCYER SNEFQFEASA EYYFENVYKF ADPDYYCTDT 

■ 251 DILKGRIKTT GITETNFNIN SFKFKVLDAG GQRSERRKWI HCFDNITAVL 

301 FVLAISEYDQ MLFEDERVNR LHESIVLFDS LCNSKWFANT PFILFLNKTD 



351 I FEKKIQRSP LKQYYPEYNG KPQDSAEAMK FFETNFLKLN RTNKPI YVHR 

4 01 TCATDSKSMK FVLSAVTDVI VQQNLKKSGI M 
HITS AT: 335-343 
MF Unspecified 
CI MAN 
SR GenBank . 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 . ANSWER 7 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 925385-98-4 REGISTRY 

CN ElaA protein (Methyiibium pet roleiphilum strain PM1) (CA INDEX NAME) 

OTHER NAMES: 

CN GenBank ABM96751 

CN GenBank ABM96751 (Translated from: GenBank CP000555) 
FS PROTEIN SEQUENCE 
SQL 155 

SEQ 1 MDHPLPEISW RCARLHELSP LELQRIHIAR QQVFAVEQDC VFQDADEVDE 

51 HSAHLAAWRA DGVLLAYARL VDPGVKYAEP SLGRVLTTAV ARGTGVGRAL 
101 VRRAVDHLTG AFPGQGLRIS AQLRLERFYA EAGFLSIGEP YLEDDMPHIE 



151 MLRRG 
HITS AT: 127-135 

MF Unspecified * 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO- DATE) 

L2 ANSWER 8 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 921701-11-3 REGISTRY 

CN Protein (Oryza sativa japonica strain Nipponbare gene Os08g0104 300 ) (CA 
INDEX NAME) 
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OTHER NAMES : 
CN GenBank BAF22691 
CN GenBank BAF22691 
FS PROTEIN SEQUENCE 
SQL 495 

SEQ 1 MNCCSSEAVL 

51 ICLMAQQGAC 
101 ACSPLRKKAA 
151 GVLVSMAGSS 
201 SIMAISFTSL 
251 AMPS LI FT KV 
301 LTSWKTFCPV 
351 VAVSPPRPIR 



(Translated from: GenBank AP008214) 



SSKVALLNSL 
NVVLIANNTT 
NGPVSPFALV 
SGIYIYAVFL 
LAMAAVLATC 
QEDNSTSSSC 
CKRDASAGTS 
RHPSSQSTSR 



FCTIVLPKEN 
LSFDDVEATF 
IRGGCQFDDK 
SKASGEVLKK 
FFVRRHQIRR 
AICLEDYSFG 
KPPASESTPL 
AYSISSAPRN 



CSNKMNCTKG 
TPEVKDSGVN 
VRNAQNAGFK 
YSGQSDVEVW 
DRGRIPVTRE 
EKLRVLPCRH 
LSSVIHLSAE 
YNLQRYYTNS 



GGFPLLFCAV 
GAIYAVEPLD 
AVIVYDDEDS 
ILPVYENSAW 
FHGMSSQLVK 
KFHATCVDMW 
STALSSFRST 
PYISTSRSNV 



401 DLANMSSQWS HTPHQASMHS LRSGHLSLPI NIRYTIPHVS RSDYGSASLG 
451 LSHDSCSHHG SPSYYHSSLG QQRS YLMHRT ESGPSLSTMV LQSPQ 
HITS AT: 385-393 

* *RELATED SEQUENCES AVAILABLE WITH SEQLINK** 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 9 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 919550-33-7 REGISTRY 

CN Proline iminopept idase (Roseobacter denitrif icans strain OCh 114 gene pip) 

(CA INDEX NAME) 
OTHER NAMES: 
CN GenBank ABG30110 

CN GenBank ABG30110 (Translated from: GenBank CP000362) 
FS PROTEIN SEQUENCE 
SQL 313 

SEQ 1 MQYLYPPVDP FDQRMLDVGQ GHRIYVEQCG NPAGIPVIVL HGGPGGGCSP 

51 AMRRYFDPTV FRVILFDQRG CGRSRPHASV THNTTWHLVD DIELIRRTLD 
101 IDDWIVFGGS WGATLSLIYA QSHPDRTRHL VLRGVFLMTQ AELDWFYGGG 
151 AGKFWPEVWA RFTGPIPEDE RGDLIEAYRR RLFSGDMPQE TRFAKAWSSW 
201 ENALASIHSS GTSGDAPGEY ARAFARLENH YFSNAGFLDF DGQILANVGR 

251 IAHIPGVIVQ GRYDMICPPD SAYRLAEAWE NCELKMVRNA GHALSEPGIS 

301 AELVRTMDRI GGR 
HITS AT: 230-238 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 10 OF 82 REGISTRY 
RN 914168-75-5 REGISTRY 



COPYRIGHT 2008 ACS on STN 
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CN Protein (Methanosarcina barkeri strain Fusaro 112-amino acid) (CA INDEX 
NAME) 

OTHER CA INDEX NAMES: 

CN GenBank AAZ70949 (9CI) 

OTHER NAMES: 

CN GenBank AAZ70949 (Translated from: GenBank CP000099) 
FS . PROTEIN SEQUENCE 
SQL 112 



SEQ 



1 MSKIFNYMCF LISQTSNKKF LGIIRNIFQK SEIKKLNMAK EGIENLEHHK 
51 TVFNCLLNTR RVNIRQRLSG RQFRTSRKCL YHNQFLLRVR FYSETPFLFR 



101 TRFILKRLFI HV 
HITS AT: 90-98 
MF Unspecified 
CI MAN ' 
SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 11 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 912220-07-6 REGISTRY 

CN Esterase/lipase (Lactobacillus casei strain ATCC 334) (9CI) 

NAME) 
OTHER NAMES: 
CN GenBank ABJ70609 

CN GenBank ABJ70609 (Translated from: GenBank CP000423) 
FS PROTEIN SEQUENCE 
SQL 269 ■ ' 



(CA INDEX 



SEQ 



1 MQVEKDVI YD SAYNLAADLY VPDEANGGAI VYAHGGGWFR GDKENESDLG 
51 KYFADAGYLF TIPNFRLAPK YLYPTAQNDF DHFINWLLAS PYDFDRERLG 



101 LLGASSGGTM VLQNSLASGY PLVAWSPVVD FANWVQKNQM VKASVDGKKE 
151 LGLTEIHEIH DSFYKYFVQT YLGGLDPRLL TAVNPTNHLT DQLGPALLFN 
201 SADELMPLPS ALHFLQQAAM FGRDIGLHVV PGTGHARDYT SFALPETKRF 
251 FDHHLFTTVE DKTLNKATD 

HITS AT: 51-59 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 12 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 912087-07-1 REGISTRY 

CN Protein (Oenococcus oeni strain PSU-1 429-amino acid) 

NAME) 
OTHER NAMES: 
CN GenBank ABJ57333 

CN GenBank ABJ57333 (Translated from: GenBank CP000411) 
FS PROTEIN SEQUENCE 
SQL 429 



(9CI) (CA INDEX 
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SEQ 1 
51 
101 



MKRKGLFSLG 
VVFLLFLLTI 
VELTHPSIGF 



NRIIFYIFNF 
SLAIFVNEKL 
DAGAIHSALT 



FIFLTLYFAI 
RNVFRYIFIK 
DTKSRELQAY 



ASPNLTLGDT 
RALFSSIVIF 
YSLSTNNINL 



PAGNRTTWIS 
VLVILLQIIF 
MLQQHYLAKF 



(CA INDEX 



151 FSNSSWLLFD NINLFLVDLS AVFNALIVAV LDRRKISNVL YIQSIWLALF 

201 PMILVPYSDT WVLPFVSLYL LSYCIVFHSK FNGILRVFLA ISGGSVLSAS 
251 YFIKPSAIIP FIAIFLVEIL YLFKKDDRRK KKYAFI ILIS SLFFIVSTGI 
301 TYRYLQNANN SETYMKIDRS RTMPAIHFIS MGMAGDGGYN KEDNLAMAAR 
351 PSKKEKVEYS KRKINQRLAR MGIFGYIKFL FHKQNKNSSD GSFAWLIEGH 
4 01 FMSAKVASKG LKAFCSKFCI SEWQTFSGF 

HITS AT: 149-157 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1. REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 13 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 909480-03-1 REGISTRY 

CN Ferrochelatase (Bordetella avium strain 197N gene hemH) (9CI) 

NAME) 
OTHER NAMES: 
CN GenBank CAJ50331 

CN GenBank CAJ50331 (Translated from: GenBank AM167904) 
FS PROTEIN SEQUENCE 
SQL 364 



SEQ 1 VFLRLFKYLW PERFLPEPAT AEDRFDDNPP PRGPGRAGVL LINLGTPDAP 

51 TAADIRKYLA EFLSDQRVIE IPRYLWKPIL HGLVLTFRPK KLAPRYAGIW 

101 MEGGSPLMVY SRKQAEGVAA ALAERGLDVP VALGMRYGNP SVPDAIAQLR 

151 AQGCDHILTV PLYPQYAAST TATAVDAVTR HASRLRDQPA LRFVKRFYAD 



201 PAYIEAQAER IQSFWDAQGK PQKLLMSFHG LPRYSIELGD PYYRDCLDTA 

251 RLLRKRLGLS PEEVEVTFQS RFGSARWMEP YTEPTLKMLA AQGITHIDVV 
301 CPGFVADCLE TLEEINQECR HAFMEAGGQQ FRYIPALNDS PLWVSGLADL 
351 VETQLQAWPV KRAG 

HITS AT: 196-204 . 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 14 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 908963-98-4 REGISTRY 

CN Proline iminopeptidase (Rhizobium leguminosarum viciae strain 3841 gene 

pip) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank CAK08026 

CN GenBank CAK08026 (Translated from: GenBank AM236080) 
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FS PROTEIN SEQUENCE 
SQL 324 

SEQ 1 MSALYPEIEP YDHGQLDTGD GNLI YWEACG NPAGRPALVL HGGPGSGCST 

51 AARRYFDPDA YRI ILFDQRN CGRSLPSAAD PETDLSLNTT WHLVADIERL' 
101 RAHLGIDSWL LFGNSWGSTL ALAYAETHPE RVAAIVLSGV TTTRRSEIDW 
151 LCRGMAPLFP EEWHRFRQVI PAGSQGRDED IVAAYHRLLN DPDPETRFKA 
201 ARDWHDWEAA SILLADPQGR PRRWADPAYM LTRARIITHY FSNGAWLEDG 



251 QLLKNAARLI GSPGILLQGR LDIEAPLVTA WELARAWPQS ELSILAHAAH 

301 STANPDMSAA IVTATDRFRY FPQK 
HITS AT: 239-247 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE)^ 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 15 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 889725-65-9 REGISTRY 

CN LOC432185 protein (Xenopus laevis clone IMAGE : 694 9153 gene LOC432185) 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank AAH94122 ; 

CN GenBank AAH94122 (Translated from: GenBank BC094122) 
FS PROTEIN SEQUENCE 
SQL. 722 



SEQ 1 VKMAALSEHF TLCGLLTGTD DGKSEILGVE PAGEPDRVLV TDSVQAVTLY 

51 KVSDQKPQGA WAVKQGQSIT CPAVLNPESG EFIVVHDDKV LRIWKEDNVN 

101 LDIAFKATLS ADVCRIHTLP NTDPLVLFKG GAVHFLDSLL TDPQQKIGTV 

151 LSDGERIVWS EIFADDGQPL IVYLTQQFSN YFVYIHKFSP VCVCKYHLKP 

201 NTEDSTILDC SGSVKSKIFT LLTLYSSGQV CQTPFPVSLI NKETERVVSA 

251 SPLLQLSGPI ' EVGALNFLDE SHVAVLISSS SEQKECLSIW NTTFQTLQAA 

301 RNFQQRTSAQ LWCYDNKLFV PHGKTLVVVP YVCEASCLAS VLGKSRNIQT 

351 SVLENVPFVN WDKLVGKDPE TKPSNAGAQK KTRERKTNAN AGNGTESILY 

401 PFDVQNISQT QTEAFVQQLL LGKEDTDFQI TVGKITQGLV KRCMADPKFY 

451 PQSSFVQLVQ TNTLSYSLCP DLLSLFLEKR DVPLLQLCLH SFPDVPEVIL 



501 CLCLKAFLSI SEKLVNAAQI NTELASLYID VGDKDKEHKY TEHPEEPSVL 
551 QNGFSPTALE EDSCDELIAE SLPQTTQKAT CPISIKRAVL VNSILISPYN 
601 ESFLLPHLKD MSGDQVMFFL RYLLYLYLKF NENITINHPG KQMPTVSQIV 
651 DWMSMLLDAH FATVVMLSDA KALLNKIQKT VKSQLKFYSE MNKIEGCLAE 
701 LKELKCPARV SARYSIEVLQ LY 

HITS AT: 448-456 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patent s : BIOL (Biological study); PRP (Properties) 
2 REFERENCES IN FILE CA (1907 TO DATE) 
2 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 16 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
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RN 881346-58-3 REGISTRY 

CN Phage portal protein, lambda 

(9CI) (CA INDEX NAME)' 
OTHER NAMES: 



(Nitrobacter winogradskyi strain Nb-255) 



CN 
CN 
FS 
SQL 

SEQ 



GenBank ABA04073 
GenBank ABA04073 
PROTEIN SEQUENCE 
511 

1 MAANWIDRAL 
51 SSSADGEIAS 
101 AGTDALDNRI 
151 RIRRTEDKLP 
201 AYWLFPDHPG 
251 MRAMRDLDDW 
301 EQFEPGLIAY 
351 TGDLSQVNYS 



(Translated from: GenBank CP000115) 



ASVAPGAARK 
GASRLRDRMR 
NELWEAWAAR 
VPLQLQLLEA 
DTSVPLSRSL 
TNAELVRKKT 
ARGGKDIKFN 
SLRGGLVEFR 



RLLERQAFEK 
DLTRNNPHAA 
SDADGLADFH 
DHLDDTKIAA 
TSARVPADGI 
EACLVGVVLG 
QPASTAGVSE 
RMVDALQWQL 



LARAYDGAAV 
KAVAVL VNN I 
GLTTLAVREM 
LPDGGRIVRG 
AHLFERQRVQ 
ADEADQGVAP 
WLRAQLHI IA 
VIPGFCEPVW 



GRRTDGWRSS 
VGAGIRPRAA 
IEGGDVFLRR 
IEYDAIGRRR 
SRGVPWGAPA 
TVVDAEGKTI 
AGYRVPYELL 
RWFTEAAWVA 



4 01 GLIPNPVVKV EWQPPRFDAV DPLKDAQADL LMLRSGTMTL AQAIARQGYD 
4 51 PASQLGEIAE MNSALDRLKI VLDSDPRMMT NAGTAQPDPN DPAGDTADNE 
501 QPGKSKPKPG D 

HITS AT: 391-399 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 
DT . CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); 

1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



PRP (Properties) 



L2 ANSWER 17 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 877723-81-4 REGISTRY 

CN Protein ( Desulf itobacterium hafniense strain Y51 216-amino acid) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 
CN GenBank BAE837 92 

CN GenBank BAE83792 (Translated from: GenBank AP008230) 
FS PROTEIN SEQUENCE 
SQL 216 

SEQ 1 MRLRRKAWAR PELESDPKVI YNPMHYKENW QEAFGNNHPV HLELGCGRGQ 

51 FINQCAELNP HINYIAI DLY DEVLVKALRK INEKALHNVR VIPMNIAKLE 
.101 SIFKHDQIEK IYINFCNPWP SRRHHHKRLT HPQFLSVYKK LMKDHSEIWF 
151 KTDDDELFKD SLKYFAEAGF IEKYRTFDLH QSEFTENIKT EYEEKFSNQG 



201 VKIKFGIFVV NKGRQN 
HITS AT: 163-171 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS 

DT . CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 18 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 877707-56-7 REGISTRY 
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CN Protein ( Desulf itobacterium hafniense strain Y51 518-amino acid) 

(CA INDEX NAME) 
OTHER NAMES: 
CN GenBank BAE82167 

CN GenBank BAE82167 (Translated from: GenBank AP008230) 
FS PROTEIN SEQUENCE 
SQL 518 



(9CI) 



SEQ 1 MKLNLAVFAY HLADYAPRLY DSEIHGLGVA DVRFLDRQET FLSESVYIGM 

51 WDKLQALIKP PFYVVCIGRD ETTDRWLEEH GVKALILDSE VELNEVFERL 
101 QDIFQHYNRL ESELLEAVLC KEPLNVLLNI CAKFFANPTF ITDAALCLVA 



151 TCDNFAHPET DRAWQETIES GRSSSELLLL 
201 VGENYSKIIC ANYFDQEVRI ATFTVSEAYT 
251 VRKQHKASSR YLFAIRSNIS NLLHGQKIDE 
301 LKIPLTTAEL ADGTAGHNRK IYESIFPHYV 
351 DKRRFAELKK HLKNENAACG VSKTFHDFDM 
401 KEALYFYEDM MLEHLFSEGS RIFNLRSLCH 
4 51 LKVYLLQEKS LLAASQELHI HRNTLVYRLG 
501 LSCLILEYLN GLEEKAPD 

HITS AT: 133-141 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 
DT.CA CAplus document type: Journal 
RL.NP Roles from non-patents: BIOL (Biolog 
1 REFERENCES IN FILE CA (1907 
1 REFERENCES IN FILE CAPLUS ( 



MKRKKLTNLL 
PLSPLQAGLV 
AILKTNFAHI 
SLDLNEVLIL 
LGEEYKLANA 
DSVLRIAEHD 
KIEQISGLDL 



NTSRKAEFVN 
DHVAKLLTAE 
GWTDQEDYRL 
VIRCARDTEA 
ALEIGSWKYP 
QQNNSSLLQT 
NSPLIRLQGI 



ical study); PRP (Properties) 

TO DATE) 
1907 TO DATE) 



L2 ANSWER 19 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 

RN 868817-72-5 REGISTRY 

CN Phage tail fiber-like protein (cyanophage P-SSM2) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank 'AAX44 672 

CN GenBank AAX44672 

FS PROTEIN SEQUENCE 

SQL 1170 



(Translated from: GenBank AY939844) 



SEQ 1 MTRFRSGRVP HQHIGISSFT EDKLVLDVIG NATISGTLSV GVGTNNSGNG 

51 NISIGGTTGT IGQYLKSTGV GVTWADFADV RDSQSFVATA GQTTFVFTAN 
101 ELNQYNPNFI DVYVNGVKLI NSEYDAFNGS NVVLKSACFA GDIVELISYN 
151 TNQITNGMGG STIGINTFAT STFTHIGASG VVTATTYYGD GSNLTGVTTS 
201 LRSEYADVSG ISTISQGLTG TPDIAVRNIN VGIATFTGNL DAVDASFSGN 
251 VSIGGTLTYE DVTNIDSVGL ITARNGIDVT GPIKGFSYTQ SPYSNTVETI 
301 IVKVVTKTAA HRYNGNGSSL GYTFDGVESP FLTLTPGKTY RFDQSDNSNL 
351 THQLRFYLDP EKVVEYLRTP GEIVYSGAAG SNGSYTEIQI LDDTPTVLYY 
401 QCVNHSYMGN AVQVNTGHAI GFATAASIDT SGIITAFRYY GDGTYLSGVQ 

451 SQLVIQEEGG AVGTAGTINF VGVGVTASVT DDIATIQVGN HFAVTAGIAT 
501 VAEGITGSPN ASFSGITATG LQMTGLCTAG TFVGNLVGGV AGGNWGGYDS 
551 TATNNVSAGQ NVSAGQSVTA GNGFYGDGSG ITNLVTTSDT APSNPTDGNL 
601 WWKSDEGQLK I YYQDADSAQ WVDANASGGA GSSGGSSDLL NDTTPQLGGT 
651 LDLNNRDITG TGNISINGGL TLIGGTSNIG NVYSTGIVTA STFYGNIVGD 
701 LSGTSTGNVT GTVNATGLST FTTLDINGDV NVSGASTITG ALDVTDSVDA 
751 HSINVSTAVT AISFHGDGQY LTNIYSAPPQ GISTTGYTGL TNLFCGGYLE 
801 VDGQSILDDV IVSAAATFQG ALSANTGPVT LNNATINGTT NVTGPSTLTT 
851 VNVVGHSELD NVNISGVATA IGFVGPLTGN AATASNLSGS PSISVTNVSA 
901 GIITATNGYY GNVNVNNQNI EVGNCTTQGT DNTIKVGSSA LEIFHRPAAY 
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951 ATYLQNKADT NLFITGNDAT GTWGNIFIRP YLSNFSGVAC WWGGATELFY 
1001 GNNNTKRLET TSGGVNI IGT LSKSGGSFKI PHPVSGLSTT KHLVHSFLEG 
1051 PQMDLI YRGK VDLVGGTATV NIDTKAGMTE GTFVLLNRDV QCFTTNETGW 
1101 TAVKGSVTGN EITIVAQDNS CTDTISWMVV GERQDDTVKA LDMTDSEGNL 
1151 IVEPDQPAAD TKHADVQAQL 

HITS AT: 438-446 

MF Unspecified 

CI MAN 

SR GenBank 

LC ' STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN- FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 20 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 867889-10-9 REGISTRY 

CN Protein (Mus musculus strain NOD clone F630045F11 866-amino acid) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 
CN GenBank BAE41708 

CN GenBank BAE41708 (Translated from: GenBank AK170312) 
FS PROTEIN SEQUENCE 
SQL 866 

SEQ 1 MQNILYHGLI HDQVCCRQAD YWQFVKDIRW LSPHSALHVE KFISLHESDQ 

51 SDTDSVSERA VAELWLQHSL QCHCLSAQLR PLLGDRQYIR KFYTETAFLL 



101 SDAHVTAMLQ CLEAVEQNNP RLLAQIDASM FARKQESPLL VTKSQSLTAL 
151 PGSTYNPPAS YAQHSYFGSS SSLQSMPQSS HSSERRSTSF SLSGPSWQPQ 
201 EDRECLSPAE TQTTPAPLPS DSTLAQDSPL TAQEMSGSTL TSPLEASWVS 
251 SQNDSPSDVS EGPEYLAIGN PAPHGRTASC ESHSSSQKLE SAASSLGDQE 
301 EGRQSQAGSV LRRSSFSEGQ TAPVASGTKK SHIRSHSDTN IASRGAAEGG 
351 QYLCSGEGMF RRPSEGQSLI SYLSEQDFGS CADLEKENAH FSISESLIAA 
401 IELMKCNMMS QCLEEEEVEE EDSDREIQEL KQKIRLRRQQ IRTKNLLPAY 
451 RETENGSFRV TSSSSQFSSR DSTQLSESGS AEDADDLEIQ DADIRRSAVS 
501 NGKSSFSQNL SHCFLHSTSA EAVAMGLLKQ FEGMQLPAAS ELEWLVPEHD 
551 APQKLLPIPD SLPISPDDGQ HADI YKLRIR VRGNLEWAPP RPQIIFNVHP 
601 APTRKIAVAK QNYRCAGCGI RTDPDYIKRL RYCEYLGKYF CQCCHENAQM 
651 VVPSRILRKW DFSKYYVSNF SKDLLLKIWN DPLFNVQDIN SALYRKVKLL 
7 01 NQVRLLRVQL YHMKNMFKTC RLAKELLDSF DVVPGHLTED LHLYSLSDLT 
751 ATKKGELGPR LAELTRAGAA HVERCMLCQA KGFICEFCQN EEDVIFPFEL 
801 HKCRTCEECK TCYHKTCFKS GRCPRCERLQ ARRELLAKQS LESYLSDYEE 
851 EPTEALALEA TVLETT 

HITS AT: 91-99 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 21 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 867887-58-9 REGISTRY 

CN Protein (Mus musculus strain' NOD clone F630032M11 927-amino acid) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 
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CN GenBank BAE41645 

CN GenBank BAE41645 (Translated from: GenBank AK170223) 
FS PROTEIN SEQUENCE 
SQL 927 

SEQ 1 MRPESAGMDL GGGDGERLLE KSRREHWQLL GNLKTTVEGL VSANCPNVWS 

51 KYGGLERLCR DMQNILYHGL IHDQVCCRQA DYWQFVKDIR WLSPHSALHV 
101 EKFISLHESD QSDTDSVSER AVAELWLQHS LQCHCLSAQL RPLLGDRQYI 
151 RKFYTETAFL LSDAHVTAML QCLEAVEQNN PRLLAQIDAS MFARKQESPL 



201 LVTKSQSLTA LPGSTYNPPA SYAQHSYFGS SSSLQSMPQS SHSSERRPTS 
251 FSLSGPSWQP QEDRECLSPA ETQTTPAPLP SDSTLAQDSP LTAQEMSDST 
301 LTSPLEASWV SSQNDSPSDV SEGPEYLAIG NPAPHGRTAS CESHSSSQKL 
351 ESAASSLGDQ EEGRQSQAGS VLRRSSFSEG QTAPVASGTK KSHIRSHSDT 
4 01 NIASRGAAEG GQYLCSGEGM FRRPSEGQSL ISYLSEQDFG SCADLEKENA 
451 HFSISESLIA AIELMKCNMM SQCLEEEEVE EEDSDREIQE LKQKIRLRRQ 
501 QIRTKNLLPA YRETENGSFR VTSSSSQFSS RDSTQLSESG SAEDADDLEI 
551 QDADIRRSAV SNGKSSFSQN LSHCFLHSTS AEAVAMGLLK QFEGMQLPAA 
601 SELEWLVPEH DAPQKLLPIP DSLPISPDDG QHADIYKLRI RVRGNLEWAP 
651 PRPQIIFNVH PAPTRKIAVA KQNYRCAGCG IRTDPDYIKR LRYCEYLGKY 
701 FCQCCHENAQ MVVPSRILRK WDFSKYYVSN FSKDLLLKIW NDPLFNVQDI 
7 51 NSALYRKVKL LNQVRLLRVQ LYHMKNMFKT CRLAKELLDS FDVVPGHLTE 
801 DLHLYSLSDL TATKKGELGP RLAELTRAGA AHVERCMLCQ AKGFICEFCQ 
851 NEEDVIFPFE LHKCRTCEEC KACYHKTCFK SGRCPRCERL QARRELLAKQ 
901 SLESYLSDYE EEPTEALALE ATVLETT 

HITS AT: 152-160 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 22 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 867754-88-9 REGISTRY 

CN Protein (Mus musculus strain NOD clone F630007F11 927-amino acid) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 
CN GenBank BAE41507 

CN GenBank BAE41507 (Translated from: GenBank AK170002) 
FS PROTEIN SEQUENCE 
SQL 927 

SEQ 1 MRPESAGMDL GGGDGERLLE KSRREHWQLL GNLKTTVEGL VSANCPNVWS 

51 KYGGLERLCR DMQNILYHGL IHDQVCCRQA DYWQFVKDIR WLSPHSALHV 
101 EKFISLHESD QSDTDSVSER AVAELWLQHS LQCHCLSAQL RPLLGDRQYI 
151 RKFYTETAFL LSDAHVTAML QCLEAVEQNN PRLLAQIDAS MFARKQESPL' 



201 LVTKSQSLTA LPGSTYNPPA SYAQHSYFGS SSSLQSMPQS SHSSERRSTS 

251 FSLSGPSWQP QEDRECLSPA ETQTTPAPLP SDSTLAQDSP LTAQEMSDST 

301 LTSPLEASWV SSQNDSPSDV SEGPEYLAIG NPAPHGRTAS CESHSSSQKL 

351 ESAASSLGDQ EEGRQSQAGS VLRRSSFSEG QTAPVASGTK KSHIRSHSDT 

401 NIASRGAAEG GQYLCSGEGM FRRPSEGQSL ISYLSEQDFG SCADLEKENA 

451 HFSISESLIA AIELMKCNMM SQCLEEEEVE EEDSDREIQE LKQKIRLRRQ 

501 QIRTKNLLPA YRETENGSFR VTSSSSQFSS RDSTQLSESG SAEDADDLEI 

551 QDADIRRSAV SNGKSSFSQN LSHCFLHSTS AEAVAMGLLK QFEGMQLPAA 

601 SELEWLVPEH DAPQKLLPIP DSLPISPDDG QHADIYKLRI RVRGNLEWAP 
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651 
701 
751 
801 
851 
901 
HITS AT: 



MF 
CI 
SR 
LC 

DT.CA 
RL.NP 



Unspec 
MAN 

GenBan 
STN Fi 
CApl 
Role 



PRPQIIFNVH 

FCQCCHENAQ 

NSALYRKVKL 

DLHLYSLSDL 

NEEDVIFPFE 

SLESYLSDYE 

152-160 

if ied 



PAPTRKIAVA 
MVVPSRILRK 
LNQVRLLRVQ 
TATKKGELGP 
LHKCRTCEEC 
EEPTEALALE 



KQNYRCAGCG 
WDFSKYYVSN 
LYHMKNMFKT 
RLAELTRAGA 
KACYHKTCFK 
ATVLETT 



IRTDPDYIKR 
FSKDLLLKIW 
CRLAKELLDS 
AHVERCMLCQ 
SGRCPRCERL 



LRYCEYLGKY 
NDPLFNVQDI 
FDVVPGHLTE 
AKGFICEFCQ 
QARRELLAKQ 



k 

les: CA, CAPLUS 

us document type: Journal 

s from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 23 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 867454-64-6 REGISTRY 

CN Protein (Mus musculus strain C57BL/6J clone F730313B2'2 941-amino acid) 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 



CN . 
CN 
FS 
SQL 

SEQ 



GenBank BAE42541 

GenBank BAE42541 (Translated from: 

PROTEIN SEQUENCE 

941 



GenBank AK171583) 



1 MRPEGAGMDL GGGDGERLLE KSRREHWQLL GNLKTTVEGL VSANCPNVWS 

51 KYGGLERLCR DMQNILYHGL . IHDQVCCRQA DYWQFVKDIR WLSPHSALHV 

101 EKFISLHESD QSDTDSVSER AVAELWLQHS LQCHCLSAQL RPLLGDRQYI 

151 RKFYTETAFL LSDAHVTAML QCLEAVEQNN PRLLAQIDAS MFARKQESPL 



201 LVTKSQSLTA LPGSTYTPPA SYAQHSYFGS 
251 FSLSGPSWQP QEDRECLSPA ETQTTPAPLP 
301 LTSPLEASWV SSQNDSPSDV SEGPEYLAIG 
351 SSSHLFSSSS SQKLESAASS LGDQEEGRQS 
401 SGTKKSHIRS HSDTNIASRG AAEGGQYLCS 
451 QDFGGCADLE KENAHFSISE SLIAAIELMK 
501 EIQELKQKIR LRRQQIRTKN LLPAYRETEN 
551 SESGSAEDAD DLEIQDADIR RSAVSNGKSS 
601 GLLKQFEGMQ LPAASELEWL VPEHDAPQKL 
651 KLRIRVRGNL EWAPPRPQII FNVHPAPTRK 
7 01 YIKRLRYCEY LGKYFCQCCH ENAQMVVPSR 
7 51 LKIWNDPLFN VQDINSALYR KVKLLNQVRL 
801 LLDSFDVVPG HLTEDLHLYS LSDLTATKKG 
851 MLCQAKGFIC EFCQNEEDVI FPFELHKCRT 
901 CERLQARREL LAKQSLESYL SDYEEEPTEA 

HITS AT: 152-160 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 
DT.CA CAplus document type: Journal 
RL.NP Roles from non-patents: BIOL (Biolog 
1 REFERENCES IN FILE CA (1907 
1 REFERENCES IN FILE CAPLUS ( 



SSSLQSMPQS 
SDSTLAQDSP 
NPAPHGRTAS 
QAGSVLRRSS 
GEGMFRRPSE 
CNMMSQCLEE 
GSFRVTSSSS 
FSQNLSHCFL 
LPIPDSLPIS 
IAVAKQNYRC 
ILRKWDFSKY 
LRVQLYHMKN 
ELGPRLAELT 
CEECKACYHK 
LALEATVLET 



SHSSERRSTS 
LTAQEMSDST 
CESHSSNGES 
FSEGQTAPVA 
GQSLISYLSE 
EEVEEEDSDR 
QFSSRDSTQL 
HSTSAEAVAM 
PDDGQQADIY 
AGCGIRTDPD 
YVSNFSKDLL 
MFKTCRLAKE 
RAGAAHVERC 
TCFKSGRCPR 
T 



ical study) ; PRP (Properties) 

TO DATE) 
1907 TO DATE) 



L2 ANSWER 24 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 863073-18-1 REGISTRY 

CN Protein ftsX (Corynebacterium glutamicum strain ATCC13032 clone RXA00009 
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gene ftsX) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 78: PN: US 20050191732 SEQID: 86 claimed protein 
FS PROTEIN SEQUENCE 
SQL 300 

PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source | Reference 



Not Given|US2005191732 
| claimed SEQID 
186 



SEQ 1 MAFGYVLREA VRGMGRNVTM TIALIITTSI SLALLATGFL VTNMTDRTKD 

51 IYLDRVEVMI QLDEDTSAND PECTAESCTE VRDVLEGLDG IDSITYRSRE 

101 ASYERFVEVF KDTDPVLVAE TSPDALPAAF HVRLEDPLAV EILDPVRDLP 

151 QVSNVIDQVD DLRGATENLD SIRNATFLIA AVQVLASIFL IANMVQIAAF 

201 NRREETEIMR IVGASRFYTQ GPFVFEAILS TLIGAVFAVG ALFLGKELVI 



251 DKALRGLYDS QLIAPVTTTD IWLVAPIISG IGVVIAGIIA QLTLRFYVRK 
HITS AT: 216-224 

**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS, USPATFULL 
DT.CA CAplus document type: Patent 

RL.P Roles from patents: BIOL (Biological study); PRP (Properties); USES 
(Uses) 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 25 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 853861-89-9 REGISTRY 

CN Cell division protein FtsX ( Corynebacterium jeikeium strain K411 gene 

ftsX) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank CAI36614 

CN GenBank CAI36614 (Translated from: GenBank CR931997) 
FS PROTEIN SEQUENCE 
SQL 300 

SEQ 1 MKTNFVFREA FSGLRRNMTM TIAMIITTSI ALALLATGFL LSAMTERTKD 

51 IYFDRIEVMV QLDDKISSSD PTCASPECSE IKQQLEGDEA VKSVTFRNKE 
101 QSYERFVELF GESDPQLVEQ TEKDALPAAF HVRLADPENS APIDALRDNP 
151 TVTNIVDQGD NLEAAMRNLD SIRNASFIVA AVQAIAAIFL IMNMVQITAY 
201 SRRSEISIMR MVGASRWYTQ APFVLEAMMA ALIGAVLAVG GMFAAKILVV 



251 DRALKAVYDQ QLVARVTNSD LWLAAPFVVL VGVVVAAITA QVTLRWYVKN 
HITS AT: 216-224 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
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1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 26 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 825584-79-0 REGISTRY 

CN Protein, similar to human and bovine NADH-ubiquinone oxidoreductase 39 kDa 

subunit (Ehrlichia ruminantium strain Welgevonden) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank CAI26527 

CN GenBank CAI26527 (Translated from: GenBank CR925678) 
FS PROTEIN SEQUENCE 
SQL 320 

SEQ 1 MSIRRIIIFG GSGFIGKYLV RYFSNAGYII KVFTRCPEKA KQLRLCGLLG 



51 QIEIVSGDIN NNKELVEHIS GCYGVINLIG TLYNTKKTTF YNVHAHVAEN 

101 IAKIAKQLNV ELMVHFSAMG IDNICNSDYA KSKLIGERLV KESFPDAVIV 

151 RPNLVFGPED KFFNKFARLL MILPFLPVVG GGKFVFQPVY VDDVAKLVFH 

201 I IDYKIKDKL YNVCGPSTYS FKELLNLILS ITHRKSKLFN ISFCLASILA 

251 FVFEIKIISI FSKLITGSTD PILTRDQVKF MMGMTQLHDM YPIDDLKEMG 
301 INFATVEDIV PQYLEIYKKS 
HITS AT: 21-29 



**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Rol-es from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 27 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 825575-27-7 REGISTRY 

CN Protein, similar to human and bovine NADH-ubiquinone oxidoreductase 39 kDa 

subunit (Ehrlichia ruminantium strain Gardel) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank CAI27484 

CN GenBank CAI27484 (Translated from: GenBank CR925677) 
FS PROTEIN SEQUENCE 
SQL 320 

SEQ 1 MSIRRIIIFG GSGFIGKYLV RYFSNAGYII KVFTKCPEKA KQLRLCGLLG 



51 QIEIVSGDIN NNKELVEHIS GCYGVINLIG TLYNTKKTTF YNVHAHIAEN 
101 IAKIAKQLNV ELMVHFSAMG IDNICNSDYA KSKLIGERLV KESFPDAVIV 
151 RPNLVFGPED KFFNKFARLL MILPFLPVVG GGKFVFQPVY VDDVAKLVFH 
201 IIDYKIKDKL YNVCGPSTYS FKELLNLILS ITHRKSKLFN IPFCLASILA 
251 FVFEIKIISI FSKLITGSTD PILTRDQVKF MMGMTQLHDM YPIDDLKEMG ' 
301 INFATVEDIV PQYLEI YKNS 

HITS AT: 21-29 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) ' 
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1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 28 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 811265-70-0 REGISTRY 

CN Amino acid transporter (Cryptococcus neoformans neoformans strain JEC21) 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank AAW43077 

CN GenBank AAW43077 .(Translated from: GenBank AE017344) 
FS PROTEIN SEQUENCE 
SQL 556 

SEQ 1 MGSDSPKAYE MEDYDPKVAG SVDQTEIHIN ETGEGDIMRA RVLDSVNHRK 

51 LNARQIQLSS IAGAIGAALF VAIGSGVTAG PVALLIGFIF WATVVYSIAQ 
101 CQLEIVSLFP LDGSFIRLAG RMVDPALGTM VGYNHFFAQT SFVIFEATVV 



151 NTLVSYWGYS ESPAILISVS LLLYLAINVY RADLFGEAEF WLALGKVLLA 
201 IGLILYTLIT MVGGNPLKDR FGFRYWKDPG PWAGDSPSTR LESFINAVNT 
251 AGFCIGGPEY ISMIAGEATD PRKTVPRAFK TIMARLVVFF IGGALCVGIL 
301 VPYNDPTLVA GDGTYAGGSP YVISMNRLKI PVLPSIVTAA LLTCIVSGGN 
351 AYTFNASRSL HALALDGKAP AVLRRLNKKG VPYLAVIVVM LFSCLAYLAL 
4 01 GSTSAKVLNW ILNFCTAATM LNWCVMAFTY VRFYSAMKVQ NIDRKEFLPV 
451 YSKFQPFAGY WALCWACLFI WLQGYSVFLK GNWNVATFIF NYGIIALAGA 
501 IGLFFKIYER TPFHKSKDVD LHSDLDFFDA LNQYYQQKKD DSPPANVKDK 
■ 551 IMAKLF 

HITS AT: 135-143 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 2 9 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 810014-10-9 REGISTRY 

CN NADH-ubiquinone oxidoreductase subunit (Ehrlichia ruminantium strain 

Welgevonden) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank CAH57752 

CN GenBank CAH57752 (Translated from: GenBank CR767821) 
FS PROTEIN SEQUENCE 
SQL 320 

SEQ 1 MSIRRIIIFG GSGFIGKYLV RYFSNAGYII KVFTRCPEKA KQLRLCGLLG 



51 QIEIVSGDIN NNKELVEHIS GCYGVINLIG TLYNTKKTTF YNVHAHVAEN 
101 IAKIAKQLNV ELMVHFSAMG IDNICNSDYA KSKLIGERLV KESFPDAVIV 
151 RPNLVFGPED KFFNKFARLL MILPFLPVVG GGKFVFQPVY VDDVAKLVFH 
201 IIDYKIKDKL YNVCGPSTYS FKELLNLILS ITHRKSKLFN ISFCLASILA 
251 FVFEIKIISI FSKLITGSTD PILTRDQVKF MMGMTQLHDM YPIDDLKEMG 
301 INFATVEDIV PQYLEIYKKS 
HITS AT: 21-29 

* *RELATED SEQUENCES AVAILABLE WITH SEQLINK** 

MF Unspecified 

CI MAN 

SR GenBank 
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LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents : BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 30 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 806840-45-9 REGISTRY 

CN GenBank BAD33177 (9CI) (CA INDEX NAME) 
OTHER NAMES: 



CN GenBank BAD33177 
FS PROTEIN SEQUENCE 
SQL 4 95 



(Translated from: GenBank AP004654) 



SEQ 1 MNCCSSEAVL SSKVALLNSL FCTIVLPKEN 

51 ICLMAQQGAC NVVLIANNTT LSFDDVEATF 

101 ACSPLRKKAA NGPVSPFALV IRGGCQFDDK 

151 GVLVSMAGSS SGIYIYAVFL SKASGEVLKK 

201 SIMAISFTSL LAMAAVLATC FFVRRHQIRR 

251 AMPSLIFTKV QEDNSTSSSC AICLEDYSFG 

301 LTSWKTFCPV CKRDASAGTS KPPASESTPL 

351 VAVSPPRPIR RHPSSQSTSR AYSISSAPRN 



. CSNKMNCTKG 
TPEVKDSGVN 
VRNAQNAGFK 
YSGQSDVEVW 
DRGRIPVTRE 
EKLRVLPCRH 
LSSVIHLSAE 
YNLQRYYTNS 



GGFPLLFCAV 
GAIYAVEPLD 
AVIVYDDEDS 
ILPVYENSAW 
FHGMSSQLVK 
KFHATCVDMW 
STALSSFRST 
PYISTSRSNV 



401 DLANMSSQWS HTPHQASMHS LRSGHLSLPI NIRYTIPHVS RSDYGSASLG 
451 LSHDSCSHHG SPSYYHSSLG QQRS YLMHRT ESGPSLSTMV LQSPQ 
HITS AT: 385-393 



**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 

MF Unspecified 

CI MAN 

SR GenBank 



L2 ANSWER 31 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 

RN 793506-19-1 REGISTRY 

CN Protein (Azoarcus strain EbNl 1312-amino acid) (9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN GenBank CAI09090 

CN GenBank CAI09090 (Translated from: GenBank CR555306) 

FS PROTEIN SEQUENCE 

SQL 1312 



SEQ 1 MLPNERRGDV AIIAGREWPA KVHEQREQLI RRIRRDGFAQ TMEAVAYTWF 

51 NRFAALRYME LHDYLGHGHR VLSSATAGGL PDIVAHASDL AASRQLPGLG 
101 ADVVTELKLA GNKDGELYRL LLVAQCNALS AAMPFLFERI DDETELLLPD 
151 NLLRTDSVIA KLVAEIAEED WAEVEIIGWL YQFYISEKKD QVIGKVVKSE 
201 DIPAATQLFT PNWIVQYLVQ NSVGRLWLMA NPASTLKAEW PYYIEPAEQT 
251 PEVQAQLDEL IRQRCLEGSG QWSVVSGQEG RDGDYALSGT DCLAKGNELG 
301 RAGVSGHQGV SERRGLWADE SSTSGSGIDS FQHRGRSGAE VTGGVSQFSV 
351 DRSGVEGRSG NADPDCRASQ VSDAGTGRTD TVTAGGDFED ARRTAGTSSL 
401 TTNHWPLTTP PLDPESITVL DPACGSGHIL VEAYALLKGI YLERGYRLRD 
4 51 IPRLILEKNL YGLDIDDRAA QLAGFALLMK ARADDRRLFN DPPRLNVLSL 
501 QESKGLDLDE LATHLASFGI QRGTIKALLD TFEHAKTFGS LIQIPDALNA 
551 QLPALAEALG KANETGDLYA. QAAAQDLLPL VEQAKVLGRK FDAVVANPPY 
601 MGGKGMNGAL KEFAKAAFPD SKSDLFAMFI ERGFAWCKPN GFNSMVTMQS 
651 WMFLSSFQSM REKLLEQRTI ETMAHLGARA FSEISGEVVQ TTAFVLQGQH 
701 FFGFKPTFFR LVDGQEADKE AALRSNQCRF DATAQDDFKK IPGSPVAYWV 
7 51 SRALFDVFAT LEPLQERAHV RKGLATCDDL RWVRLWHEVD YSLSNTQREP 
801 SSEEVGRLGR WFPLLKGGSF RKWYGNTAYL INWENDGRLL KASIVERYGG 
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851 GSYTKEVRSE DYYFRPCNTW SDITSGSFGA RYCPNGFITS 
901 DSDEYQFCAM LNSKPTSDWL RI INPTLHAN PGDIAQLPLP 
951 SRLGRAAVSL TRSDWDAYER SWDFQSFPLL TASADPDSTL 
1001 NHDTIVEMKR LEEENNRLFI DAYGLQDELT PDVPIEQITL 
1051 NLTEVEQWTR FRQDTMEELV SYAIGCMMGR YSLDEPGLIY 
1101 NRYKTFPADA DGIVPITDEF WFEDDAANRV REFLRAVWGA 
1151 AESLGTKANE TPDETIRRYL ADKFFKDHLQ TYKKRPI YWL 
1201 ALVYLHRYHE GTLARLRAEY VVPLTGKLQA RIDMLEKDAA 
1251 LNKQIEKLRK KHVELLAYDE KLRHYADMRI RLDLDEGVKV 
1301 EVKAVTGGAG DE 
HITS AT: 822-830 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS, TOXCENTER 
DT.CA CAplus document type: Journal 
RL.NP Roles from non-patents: BIOL (Biolog 
1 REFERENCES IN FILE CA (1907 
1 REFERENCES IN FILE CAPLUS ( 



TVGGGIYSAS 
PHEVLTGVEI 
ESSYTAWVTQ 
TVNPAYRYGG 
AHAGNVGFDA 
DTLDENMAWL 
FSSGKQGAFQ 
AAPSTATRNK 
NYGKFGDLLA 



ical study); PRP (Properties) 

TO DATE) 
1907 TO DATE) 



L2 ANSWER 32 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 7 7 5509-64-3 REGISTRY 

CN Protein (Candida albicans clone WO2002086097-SEQID-15613) 

NAME) 
OTHER NAMES: 

CN 5037: PN: WO02086097 ,SEQID: 15612 claimed protein 
FS PROTEIN SEQUENCE 
SQL 429 



(9CI) (CA INDEX 



PATENT ANNOTATIONS (PNTE) 
Sequence | Patent 
Source | Reference 

Not Given | WO2002086097 
[claimed SEQID 
I 15612 



SEQ 1 MGCGASVPVD DDEIDLFLQD KRINDAIEQS LQLRQQNSKK GVKLLLLGAG 

51 ESGKSTVLKQ LKLLHKGGFT QQERRQYSHV IWCDVIQSMK VLIIQARKLK 

101 IKLDCDQPNN SLIPYKQIIL RSDPLKQIDA DVAGGTDFLN DFVVKYSEEN 

151 KNKRRLKSTG TTDIWGKDDD SNINSDAINQ ALESSLNKDS EQFTRLSIAE 

201 AIHKLWKLDS GIKKCFDRSN EFQLEGSADY YFDNVFNFAD TNYLSTDLDI 

251 LKGRIKTTGI TETDFLIKSF QFKVLDAGGQ RSERKKWIHC FEDITAVLFV 

301 LAISEYDQNL FEDERVNRMH ESIVLFDSLC NSKWFANTPF ILFLNKIDIF 



351 ENKIKKNPLK NYFPDYDGKP DDTNEAIKFF ETNFLKINQT NKPI YVHRTC 
401 ATDSKSMKFV LSAVTDMIVQ 1 QNLKKSGI I 
HITS AT: 333-341 

**RELATED SEQUENCES AVAILABLE WITH SEQLINK* * 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS, TOXCENTER 
DT.CA CAplus document type: Patent 

RL.P Roles from patents :. BIOL (Biological study); PRP (Properties); USES 
(Uses) 

1 REFERENCES IN FILE CA (1907 TO DATE) 
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1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 33 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 736629-04-2 REGISTRY 

CN Protein (Oryza sativa clone PAT_MRT4530_76241C . 1 .pep fragment) (9CI) (CA 

INDEX NAME) 
OTHER NAMES : 

CN 3708: PN : US2004 0123343 SEQID: 178712 claimed protein 
FS PROTEIN SEQUENCE 
SQL 582 



PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source | Reference 
=========+============= 

Not Given | US2004123343 
I claimed SEQID 
I 178712 



SEQ 1 MPLGQRAGDK SESRYCGVEV 

51 PDYRPTPGSV LPVAASDLVL 

101 EITLKQEIAW ASHLSLQACV 

151 RIPLEKSEPM DEDHDGAKDN 



LDFPAGEELP AVLSHSLSSS FDFLLAPLVD 
GPAQWSSHIV GKISEWIDLD AEDEQLRLDS 
LPPPKRSSCA NYARVVNHIL QGLTNLQLWL 
SDMVGYSLIR STLPSMNSLG RWFGEPAFLT 



201 NARGYPCLSK RHQKLLTGFF NHSVQVIISG 
251 DTAVRHALSP YLDYIAYIYQ RMDPLPEQER 
301 EAQTYETFEK DTVKYTQYQR AIAKALVDRV 
351 VYAVEKNPNA VITLHSLIKL EGWESLVTII 
401 LGSFGDNELS PECLDGAQRF LKPDGISIPS 
4 51 HKDIAHFETA YVVKLHRIAR LAPTQSVFTF 
501 IPQETGSCLV HGFAGYFDAV LYKDVHLGIE 
551 PIYVPSKTPI EVHFWRCCGA TKEKGEERER 

HITS AT: 191-199 

MF Unspecified 

CI MAN 

SR . CA 

LC STN Files: CA, CAPLUS 
DT.CA CAplus document type: Patent 
RL.P Roles from patents: BIOL (Biological 
(Uses) 

1 REFERENCES IN FILE CA (1907 
1 REFERENCES IN FILE CAPLUS ( 



RSNHNVSQGG 
FEINYRDFLQ 
SDDDVSTTKT 
SSDMRCWEAP 
SYTSFIEPIT 
DHPNPSPNAS 
PNTATPNMFS 
IE 



VLSGDENHTE 
SPLQPLMDNL 
AAEETGRKLK 
EKADILVSEL 
ASKLHNDIKA 
NQRYTKLKFE 
WFPIFFPLRK 



study); PRP (Properties); USES 

TO DATE) 
1907 TO DATE) 



L2 ANSWER 34 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 

RN 734459-09-7 REGISTRY 

CN Protein (plasmid pAgK84 204-amino acid) (9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN GenBank AAS02132 

CN GenBank AAS02132 (Translated from: GenBank AY442931) 

FS PROTEIN SEQUENCE 

SQL 204 



SEQ 



1 MPLRRITFSS AKAGPVGRFS PRSIWERWPA VRLRYFANTA WLMCASSRRA 



51 LMSSAVSGAA VGAASANSRI VILRKSASSS VSFARISLAS SRISSLSRDG 

101 GLIFVVILHL VQNSSGKILF YTLWICEPLF GDCFQVFEII IGNVLTLALR 

151 KTCEKDGNVP CPKKQDRSIT AASPLPRASN PLFDEATTEI GVDQTLISSI 

201 RCLD 
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HITS AT: 34-42 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: > CA, CAPLUS, TOXCENTER 
DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 35 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 716772-46-2 REGISTRY 

CN Protein (Oryza sativa clone PAT_MRT4 530_314 57C . 1 . pep fragment) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 

CN 4175: PN : US20040123343 SEQID: 129175 claimed protein 
FS PROTEIN SEQUENCE 
SQL 514 
NTE 



type 


location 


description 


uncommon 


Aaa-4 66 





PATENT ANNOTATIONS (PNTE): 
Sequence I Patent 
Source I Reference 



Not Given|US2004123343 
I claimed SEQID 
1129175 



SEQ 1 MNCTKGGGFP LLFCAVICLM AQQGACNVVL IANNTTLSFD DVEATFTPEV 

51 KDSGVNGAIY AVEPLDACSP LRKKAANGPV SPFALVIRGG CQFDDKVRNA 

101 QNAGFKAVIV YDDEDSGVLV SSNFTVAGSS SGIYIYAVFL SKASGEVLKK 

151 YSGQSDVEVW ILPVYENSAW SIMAISFTSL LAMAAVLATC FFVRRHQIRR 

201 DRGRIPVTRE FHGMSSQLVK AMPSLIFTKV QEDNSTSSSC AICLEDYSFG 

251 EKLRVLPCRH KFHATCVDMW LTSWKTFCPV CKRDASAGTS KPPASESTPL 

301 LSSVIHLSAE STALSSFRST VAVSPPRPIR RHPSSQSTSR AYSISSAPRN 

351 YNLQRYYTNS PYISTSRSNV DLANMSSQWS HTPHQASMHS LRSGHLSLPI 



401 NIRYTIPHVS RSDYGSASLG LSHDSCSHHG SPSYYHSSLG QQRSYLMHRT 
451 ESGPSLSTMV LQSPQXVSSS TMANQEKSIS CGLTRSLRQA YVQHCLDSDG 
501 SLSAVTSDQS LPRC 

HITS AT: 355-363 

MF Unspecified 

CI MAN 

SR CA 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Patent 

RL.P Roles from patents: BIOL (Biological study); PRP (Properties); USES 
(Uses) 

1 REFERENCES -IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 36 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 714158-75-5 REGISTRY 
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CN GenBank AAT75885 
OTHER NAMES: 
CN GenBank AAT75885 
FS PROTEIN SEQUENCE 
SQL 510 



(9CI) (CA INDEX NAME) 

(Translated from: GenBank AE017263) 



SEQ 1 MKHAKINSHF YDECKKVREA VGGIKNIELI NRCSTRIRIK VIDITKLNLN 

51 KLEGSKIFVK TLLKDDFVQL IVNENIDEVF LNLLKSLKIS YEEIPNDFEV 

101 RVNKEGNFFK VVTDGISVIV KPLIPLLITM AIVATLSNVF NGIDFGSGTL 

151 SETGQFAKAV GEMFDILQKA LNLAFSIMIP WSIFKLMKGS QAIGISIGIV 

201 LCFHGLISTN DIMSGEYGGI FKWFGDASFL ESGYPWKISY VGQILPIVAM 



EMVAIPMITI STFFIGILLV GPLGLLLTYG 
ILGLALPWLV ITGLIQVLVV INLQQFTTFG 
AVTIINRHNK ELKRTAIPSY SLAYVSGSTE 
ATVGIIITTF AGVICTNGNA SLLVFLSITT 
MAIAIVATFI TTFFATIGLS KIKVFKEMNA 
501 KVLERDFSVN 
HITS AT: 222-230 
MF Unspecified 
CI MAN 
SR GenBank 



251 SFIAVYIERF ANKFDIPVFK 

301 MN EG WW ATT DNVAKYIFNP 

351 GTTMMPMFTQ LNIAVATSIL 

401 PALFGVGLKF VYPVIAASIG 

451 KPEILDKFNI HTIAGGPYLW 



L2 ANSWER 37 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 704848-32-8 REGISTRY 

CN Protein ( Debaryomyces hansenii strain CBS767 632-amino acid) (9CI) 

INDEX NAME) 
OTHER NAMES: 
CN GenBank CAG908 92 

CN GenBank CAG90892 (Translated from: GenBank CR382139) 
FS PROTEIN SEQUENCE 
SQL 632 



SEQ 1 MPLMYFSGED SNDITFDEPA RNRLVLDPTL FARVQNDEEQ NVRVFLDTHL 

51 LRIRSRGGLV EHAPKKFVSK NHASIGVGVA PDLSALAYIT DAMINAEFRE 

101 MSSKEFKNSY LAPLETGIRG QKFSIWPLLR SYRYPFGNPQ RVENHIQYYF 

151 NGTICVPIND I YQDICGIHH YFGNASYLRP PLNKDIFIEP DIIHFAEYES 



201 GDGHFPEICF GLGDYKTENY YLAQGFEKFK EAIEDFRRMD QQTEYLYQDA 
251 HWNPGILFAL VLSKYFYDAF LCGTNRILTS NHQSFSGFFK YDIVEGQMSV 
301 DYHIISDPET FAHGITLRSA MAGFFYQTED DAIEIQNRLR KYVSIAHTAK 
351 KSDPLLNVRP KSLRDSSMRS FDTVSEAADK ENVHEIKDKI YGNTYCRVIY 
401 DSAKCYPSLS VYLPSTVFVK LYYYSSRLWR QNDLTCFGIP DRKGYYDMFF 
451 NELVINEEIA KSQFASNFPK LFASGYWNGL TDHPMHIFEY LGKEIPREQW 
501 DEKKVYEVIM LRLKELHLLR ISHNDILRSN IHVSESGKIT LIDFGLSKFP 
551 CSEESKQDDL ESLDNIFGVN SSTNKKEEDE QVNPDITKSQ VAVNDKGDTN 
601 HNDESDENTS SDFELAEMSF ESHDTKTTTT TK 

HITS AT: 170-178 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Propertie 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 38 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 704848-31-7 REGISTRY 
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CN Protein ( Debaryomyces hansenii strain CBS767 632-amino acid) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 
CN GenBank CAG90891 

CN GenBank CAG90891 (Translated from: GenBank CR382139) 
FS PROTEIN SEQUENCE 

SQL 632 . . 

SEQ 1 MPRMYFSGED SNDITLDEPA RNRLVLDPTL FARVQNDEEQ NVRVFLDTHL 

51 LRIRSRGGLV EHAPKKFVSK NHASIGVGMA PDLSALAYIT DAMINAGFRE 
101 MSSEEFKSSY LAPLETTIRK LKLRTRPLAI QYNYPLGNPQ HVEQNIQYYF 
151 NGTICVPIND IYQDICGVHH YFGNASYLRP PLNKDIFIEP DIIHFAEYEL 



201 EGGRFPDICF GLGDYKTENY YLAQGFEEFK VAIEDFRRMD QQTEYLYQDA 
251 HWNPGILFAL VLSKYFYDAF LCGTNRILIS NHQSFSGFFK YDIVEGQMSV 
301 DYHI ISDPET FAHGITLRSA MAGFFYQTED DAIEIQNRLR KYVSIAHTAK 
351 KSDPLLNVRP KSLRDSSMRS FDTVSEAADK ENVHEIKDKI YGNTYCRVIY 
401 DSAKCYPSLS VYLPSTVFVK LYYYSSRLWR QNDLTCFGIP DRKGYYDMFF 
451 NELVINEEIA KSQFASNFPK LFASGYWNGL TDHPMHIFEY LGKEIPREQW 
501 DEKKVYEVIM LRLKELHLLR ISHNDILRSN IHVSESGKIT LIDFGLSKFP 
551 CSEESKQDDL ESLDNIFGVN SSTNKKEEDE QVNPDITKSQ VAVNDKGDTN 
601 HNDESDENTS SDFELAEMSF ESHDTKTTTT TK 

HITS AT: 170-178 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 39 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN • 704721-47-1 REGISTRY 

CN Protein (Kluyveromyces lactis strain NRRL Y-1140 gene GBA1_KLULA) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 
CN GenBank CAG98 94 2 

CN GenBank CAG98942 (Translated from: GenBank CR382126) 
FS PROTEIN SEQUENCE 
SQL 447 

SEQ 1 MGCVASTGNY ENEDDPFIQN KRANDLIEQN LQQERNKNKN EVKLLLLGAG 

51 ESGKSTVLKQ MKLLHQGGFT H.RERMQYGQV IWADAIESMR TLILQAGKLG 
101 IELDSDLKNA HSGQLVNTEL HQCKEKI FRA NTLDQIDARM AGGSEFLNEY 
151 VLKYNGIGSK KKRQTTLGFK ESNGADPEEE DETDAFLSEK LAGTSYTGSS 
201 ETSELKRIDQ STNEEIAYAI KKLWTQDKGI RQCFNRSSEF QLEGSASYYF 
251 DNIEKFARVD YVCDDMDILK GRIKTTGITE NSFKIGPSTF KVYDAGGQRS 
301 ERRKWIHCFE GITAVVFVIA ISEYDQMLFE DERVNRMHES IVLLDTLLNS 
351 RWFANTPFIL FLNKVDIFQE KVKRSPIRTW FPNYPGKLGD SETGLKYFES 



401 LFLSLNRSNK PIYVHRTCAT DTQSMRFVLG AVTDLVIQQN LKKSGIL 
HITS AT: 351-359 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS. 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
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1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 40 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 696882-65-2 REGISTRY 

CN Protein (Candida albicans strain SC5314 clone US67 47 137-SEQID-1557 6 ) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 

CN 3576: PN: US6747137 SEQID: 15576 claimed protein 
FS PROTEIN SEQUENCE 
SQL 434 



PATENT ANNOTATIONS ( PNTE") : 
Sequence | Patent 
Source | Reference 



Not Given|US6747137 

I claimed SEQID . 
115576 



SEQ 1 YQYSIMGCGA SVPVDDDEID LFLQDKRIND 

51 LLGAGESGKS TVLKQLKLLH KGGFTQQERR 

101 ARKLKIKLDC DQPNNSLIPY KQIILRSDPL 

151 YSEENKNKRR LKSTGTTDIW GKDDDSNINS 

201 LSIAEAIHKL WKLDSGIKKC FDRSNEFQLE 

251 TDLDILKGRI KTTGITETDF LIKSFQFKVL 

301 AVLFVLAISE YDQNLFEDER VNRMHEYIVL 



AIEQSLQLRQ 
QYSHVIWCDV 
KQIDADVAGG 
DAINQALELS 
GSADYYFDNV 
DAGGQRSERK 
FDSLCNSKWF 



QNSKKGVKLL 
IQSMKVLIIQ 
TDELNDFVVK 
LNKDSEQFTR 
FNFADTNYLS 
KWIHCFEDIT 
ANTPFILFLN 



351 KIDIFENKIK KNPLKNYFPD YDGKPDDTNE AIKFFETNFL KINQTNKPI Y 

4 01 VHRTCATDSK SMKFVLSAVT DMIVQQNLKK SGIM 
HITS AT: 338-346 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Patent 

RL.P Roles from patents: BIOL (Biological study); PRP (Properties) /. USES 
(Uses) 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 41 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 689706-15-8 REGISTRY 

CN LOC432185 protein (Xenopus laevis clone IMAGE : 4 960015 gene LOC432185) 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank AAH72295 

CN GenBank AAH72295 (Translated from: GenBank BC072295) 
FS PROTEIN SEQUENCE 
SQL 724 



SEQ 1 VSVKMAALSE HFTLCGLLTG TDDGKSEILG VEPAGEPDRV LVTDSVQAVT 

51 LYKVSDQKPQ GAWAVKQGQS ITCPAVLNPE SGEFI VVHDD KVLRIWKEDN 

101 VNLDIAFKAT LSADVCRIHT LPNTDPLVLF KGGAVHFLDS LLTDPQQKIG 

151 TVLSDGERI V WSEIFADDGQ PLIVYLTQQF SNYFVYIHKF SPVCVCKYHL 

201 KPNTEDSTIL DCSGSVKSKI FTLLTLYSSG QVCQTPFPVS LINKETERVV 

251 SASPLLQLSG PIEVGALNFL DESHVAVLIS SSSEQKECLS IWNTTFQTLQ 

301 AARNFQQRTS AQLWCYDNKL FVPHGKTLVV VPYVCEASCL ASVLGKSRNI 
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351 QTSVLENVPF VNWDKLVGKD PETKPSNAGA QKKTRERKTN ANAGNGTESI 
4 01 LYPFDVQNIS QTQTEAFVQQ LLLGKEDTDF QITVGKITQG LVKRCMADPK 

4 51 FYPQSSFVQL VQTNTLSYSL CPDLLSLFLE KRDVPLLQLC LHSFPDVPEV 

501 ILCSCLKAFL SISEKLVNAA QINTELASLY IDVGDKDKEH KYTEHPEEPS 
551 VLQNGFSPTA LEEDSCDELI AESLPQTTQK ATCPISIKRA VLVNSILISP 
601 YNESFLLPHL KDMSGDQVMF FLRYLLYLYL KFNENITINH PGKQMPTVSQ 
651 IVDWMSMLLD AHFATVVMLS DAKALLNKIQ KTVKSQLKFY SEMNKIEGCL 
701 AELKELKCPA RVSARYSIEV LQLY 

HITS AT: 450-458 

MF Unspecified 

CI ' MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus* document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
2 REFERENCES IN FILE CA (1907 TO DATE) 
2 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 42 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 679247-64-4 REGISTRY 

CN Protein (Listeria monocytogenes strain 4b F2365 372-amino acid) (9CI) (CA 

INDEX NAME) ' 
OTHER NAMES: 
CN GenBank AAT02809 

CN GenBank AAT02809 (Translated from: GenBank AE017322) 
FS PROTEIN SEQUENCE 
SQL 372 

SEQ 1 MKSRKKGIIL VLSIIIIFSI GLLVNNIMTN NKDTAKPKKK TVAAVKKKKE 

51 TPPKPKEPFN IDFTGDIMFD WDLRPVLAEK GMDYPFNNVR EELKSSDYTF 
101 VDLETAITTR TKKVPYQEFW IKSDPSSLTA LKNAGVDMVN ISNNHILDYY 
151 EDGLLDTTAA LRANNLAYVG AGKNEDEAYQ LKVADIKGNK VGFMSFCHFF 

201 PNTGWIADED TPGVTNGYDI NLVEEKIKEE RAKNKDIDYM VVYFHWGVEK 



251 TNTPVDYQTQ YVKKLVDDHL VDAIVASHPH WLQSFEVYKD VPIAYSLGNF 
301 LFPDYVSGHS AETGIYKLNF NQGKVTAHFD PGIISGNQIN MLDGSAKTAQ 
351 LNYLQSISPN ATINSNGDIS AK 

HITS AT: 198-206 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP. (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 43 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 674849-80-0 REGISTRY 

CN Protein DITHP (diagnostic and therapeutic protein) (human Incyte clone 

1152478 . PT22p) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2940: PN : WO2004023973 SEQID: 2940 claimed protein 
FS PROTEIN SEQUENCE 
SQL 317 



Page 25 



02/04/2008 



PATENT ANNOTATIONS (PNTE) 
Sequence I Patent 
Source | Reference 

Not Given|WO2004023973 
I claimed SEQID 
12940 



SEQ 



1 MGTRLLFWVA FCLLGADHTG AGVSQSPSNK VTEKGKDVEL RCDPISGHTA 
51 LYWYRQSLGQ GLEFLI YFQG NSAPDKSGLP SDRFSAERTG GSVSTLTIQR 
101 TQQEDSAVYL CAS SLAT AWH NHYFGEGSWL TVVGDPLPFP EDLNKVFPPE 



151 VAVFEPSEAE ISHTQKATLV CLATGFFPDH VELSWWVNGK EVHSGVSTDP 
201 QPLKEQPALN DSRYCLSSRL RVSATFWQNP RNHFRCQVQF YGLSENDEWT 
2 51 QDRAKPVTQI VSAEAWGRAD CGFTSVSYQQ GVLSATILYE ILLGKATLYA 
301 VLVSALVLMA MVKRKDF 

HITS AT: 122-130 

MF Unspecified 

CI MAN 

SR CA 

LC STN Files: CA, CAPLUS, TOXCENTER 
DT.CA CAplus document type: Patent 

RL.P Roles from patents: ANST (Analytical study); BIOL (Biological study); 
PRP (Properties); USES (Uses) 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 4 4 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 668086-58-6 REGISTRY 

CN Protein (mouse strain C57BL/6J clone MGC: 76451 IMAGE : 30432960 gene 

1700021K19Rik) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank AAH67390 

CN GenBank AAH67390 (Translated from: GenBank BC067390) 
FS PROTEIN SEQUENCE 
SQL 941 



SEQ 



1 MRPEGAGMDL GGGDGERLLE KSRREHWQLL GNLKTTVEGL VSANCPNVWS 

51 KYGGLERLCR DMQNILYHGL IHDQVCCRQA DYWQFVKGIR WLSPHSALHV 

101 EKFISLHESD QSDTDSVSER AVAELWLQHS LQCHCLSAQL RPLLGDRQYI 

151 RKFYTETAFL LSDAHVTAML QCLEAVEQNN PRLLAQIDAS MFARKQESPL 



201 LVTKSQSLTA 

251 FSLSGPSWQP 

301 LTSSLEASWV 

351 SSSHLFSSSS 

401 SGTKKSHIRS 

4 51 QDFGSCADLE 

501 EIQELKQKIR 

551 SESGSAEDAD 

601 GLLKQFEGMQ 

651 KLRIRVRGNL 

701 YIKRLRYCEY 

751 LKIWNDPLFN 

801 LLDSFDVVPG 

851 MLCQAKGFIC 

901 CERLQARREL 

HITS AT: 152-160 



LPGSTYTPPA 
QEDRECLSPA 
SSQNDSPSDV 
SQKLESAASS 
HSDTNIASRG 
KENAHFSISE 
LRRQQIRTKN 
DLEIQDADIR 
LPTASELEWL 
EWAPPRPQII 
LGKYFCQCCH 
VQDINSALYR 
HLTEDLHLYS 
EFCQNEEDVI 
LAKQSLESYL 



SYAQHSYFGS 
ETQTTPAPLP 
SEGPEYLAIG 
LGDQEEGRLS 
AAEGGQYLCS 
SLIAAIELMK 
LLPAYRETEN 
RSAVSNGKSS 
VPEHDAPQKL 
FNVHPAPTRK 
ENAQMVVPSR 
KVKLLNQVRL 
LSDLTATKKG 
FPFELHKCRT 
SDYEEEPTEA 



SSSLQSMPQS 
SDSTLAQDSP 
NPAPHGRTAS 
QAGSVLRRSS 
GEGMFRRPSE 
CNMMSQCLEE 
GSFRVTSSSS 
FSQNLSHCFL 
LPIPDSLPIS 
IAVAKQNYRC 
ILRKWDFSKY 
LRVQLYHMKN 
ELGPRLAELT 
CEECKACYHK 
LALEATVLET 



SHSSERRSTS 
LTAQEMSDST 
CESHSSNGES 
FSEGQTAPVA 
GQSLISYLSE 
EEVEEEDSDR 
QFSSRDSTQL 
HSTSAEAVAM 
PDDGQHADIY 
AGCGIRTDPD 
YVSNFSKDLL 
MFKTCRLAKE 
RAGAAHVERR 
TCFKSGRCPR 
T 
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MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 45 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 666934-61-8 REGISTRY 

CN Transcription-associated protein (Glycine max clone 

PAT_MRT3847_135535C.l.pep fragment) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2195: PN : US20040031072 SEQID: 182195 claimed protein 
FS PROTEIN SEQUENCE 
SQL 120 

PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source | Reference 
=========+============= 

Not Given|US2004031072 
I claimed SEQID 
1182195 



SEQ 



1 MFNNSPLVMA SRRFQQKWGN CHSAYIMLCV LGINPLNEHS YIYHFCRDTI 
51 LTTI YYVYHN PKFVKAQLHW YPETTYIYL'Q NPVVARPRLT RLCVYKQRRL 



101 KKKSGINALS RCAVWASNNH 
HITS AT: 69-77 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Patent 

RL.P Roles from patents: BIOL (Biological study); PRP (Properties); USES 
(Uses) 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE)/ 

L2 ANSWER 46 OF 82 REGISTRY . COPYRIGHT 2008 ACS on STN 

RN 666559-25-7 REGISTRY 

CN GenBank BAD03310 (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank BAD03310 (Translated from: GenBank AP004654) 

FS PROTEIN SEQUENCE 

SQL 4 95 



SEQ 1 MNCCSSEAVL SSKVALLNSL FCTIVLPKEN CSNKMNCTKG GGFPLLFCAV 

51 ICLMAQQGAC NVVLIANNTT LSFDDVEATF TPEVKDSGVN GAIYAVEPLD' 

101 ACSPLRKKAA NGPVSPFALV IRGGCQFDDK VRNAQNAGFK AVIVYDDEDS 

151 GVLVSMAGSS SGIYIYAVFL SKASGEVLKK YSGQSDVEVW ILPVYENSAW 

201 SIMAISFTSL LAMAAVLATC FFVRRHQIRR DRGRIPVTRE FHGMSSQLVK 

251 AMPSLIFTKV QEDNSTSSSC AICLEDYSFG EKLRVLPCRH KFHATCVDMW 

301 LTSWKTFCPV CKRDASAGTS KPPASESTPL LSSVIHLSAE STALSSFRST 

351 VAVSPPRPIR RHPSSQSTSR AYSISSAPRN YNLQRYYTNS PYISTSRSNV 
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401 DLANMSSQWS HTPHQASMHS LRSGHLSLPI NIRYTIPHVS RSDYGSASLG 
451 LSHDSCSHHG SPSYYHSSLG QQRSYLMHRT ESGPSLSTMV LQSPQ 
HITS AT: 385-393 

* *RELATED SEQUENCES AVAILABLE WITH SEQLINK** 

MF Unspecified 

CI MAN 

SR GenBank 

L2 ANSWER 47 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 644705-44-2 REGISTRY 

CN GenBank CAF19509 (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank CAF19509 (Translated from: GenBank BX927150) 
FS PROTEIN SEQUENCE 
SQL 300 

SEQ 1 MAFGYVLREA VRGMGRNVTM TIALIITTSI SLALLATGFL VTNMTDRTKD ' 

51 IYLDRVEVMI QLDEDTSAND PECTAESCTE VRDVLEGLDG IDSITYRSRE 
101 ASYERFVEVF KDTDPVLVAE TSPDALPAAF HVRLEDPLAV EILDPVRDLP 
151 QVSNVIDQVD DLRGATENLD SIRNATFLIA AVQVLAS I FL IANMVQIAAF 
201 NRREETEIMR IVGASRFYTQ GPFVFEAILS TLIGAVFAVG ALFLGKELVI 



251 DKALRGLYDS QLIAPVTTTD IWLVAPIISG IGVVIAGIIA QLTLRFYVRK 
HITS AT: 216-224 



* *RELATED SEQUENCES AVAILABLE WITH • SEQLINK** 

MF Unspecified 

CI • MAN 

SR GenBank 



L2 ANSWER 48 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 638183-33-2 REGISTRY 

CN L-Leucine, L-arginyl-L-tryptophyl-L-phenylalanyl-L-prolyl-L-asparaginyl-L- 

alanyl-L-prolyl-L-tyrosyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 40: PN: WO03106682 SEQID: 54 unclaimed, sequence 
FS . PROTEIN SEQUENCE; STEREOSEARCH 
SQL 9 

PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source I Reference 



Not Given|WO2003106682 
I unclaimed. 
I SEQID 54 



• SEQ 1 RWFPNAPYL 



HITS AT: 1-9 

MF C58 H78 N14 012 

SR CA 

LC STN Files: CA, CAPLUS, TOXCENTER, US PAT FULL 

DT.CA CAplus document type: Patent 

RL.P 1 Roles from patents: PRP (Properties) 



Absolute stereochemistry. 
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PAGE 1-A 

NH 




Ph 



PAGE 1-B 




**PROPERTY DATA AVAILABLE IN THE 'PROP' FORMAT** 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 4 9 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN - 638183-30-9 REGISTRY 

CN L-Leucine, L-arginyl-L-phenylalanyl-L-phenylalanyl-L-prolyl-L-asparaginyl- 

L-alanyl-L-prolyl-L-tyrosyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 39: PN: WO03106682 SEQID: 53 unclaimed sequence 
FS PROTEIN SEQUENCE; STEREOSEARCH 
SQL 9 

PATENT ANNOTATIONS (PNTE) : 
Sequence I Patent 
Source I Reference '. 



Not Given|WO2003106682 
. I unclaimed 
I SEQID 53 



SEQ 1 RFFPNAPYL 
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HITS AT: 1-9 

MF C56 H77 N13 012 

SR CA 

LC STN Files: CA, CAPLUS, TOXCENTER, USPATFULL 

DT . CA CAplus document type: Patent 

RL.P Roles from patents: PRP (Properties) 

Absolute stereochemistry. 




** PROPERTY DATA AVAILABLE IN THE 'PROP' FORMAT** 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 50 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 638183-01-4 REGISTRY 

CN L-Methionine, L-arginyl-L- tyros yl-L-phenyla lanyl-L-prolyl -L-asparaginyl-L- 

alanyl-L-prolyl-L-tyrosyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 14: PN: WO03106682 SEQID: 15 unclaimed sequence 
FS PROTEIN SEQUENCE; STEREOSEARCH 
SQL 9 

PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source (Reference 



Not Given|WO2003106682 
I unclaimed 
I SEQID 15 

SEQ 1 RYFPNAPYM 



HITS AT: 1-9 

MF C55 H75 N13 013 S 

SR CA 
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LC STN Files: CA, CAPLUS, TOXCENTER, USPATFULL 

DT.CA CAplus document type: Patent 

RL.P Roles from patents: PRP (Properties) 

Absolute stereochemistry. 



PAGE 1-A 



HO' 




^\ 0 
0 NH 



MeS 




OH 




(CH 2 )3 



H 



,NH 2 



NH 



PAGE 1-B 



**PROPERTY DATA AVAILABLE IN THE 'PROP' FORMAT** 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 51 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 638183-00-3 REGISTRY 

CN L-Isoleucine, L-arginyl-L-tyrosyl-L-phenylalanyl-L-prolyl-L-asparaginyl-L- 

alanyl-L-prolyl-L-tyrosyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 13: PN: WO03106682 SEQID: 14 unclaimed sequence 
FS PROTEIN SEQUENCE; STEREOSEARCH 
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SQL 9 

PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source | Reference 



Not Given|WO2003106682 
I unclaimed 
ISEQID 14 



SEQ 1 RYFPNAPYI 



HITS AT: 1-9 

MF C56 H77 N13 013 

SR CA 

LC STN Files: CA, CAPLUS, TOXCENTER, US PAT FULL 

DT.CA CAplus document type: Patent 

RL.P Roles from patents: PRP (Properties) 

Absolute stereochemistry. 

PAGE 1-A 
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PAGE 1-B 



OH 




NH 



**PROPERTY DATA AVAILABLE IN THE 1 PROP 1 FORMAT * * 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 52 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 638182-89-5 REGISTRY 

CN L- Leucine, L-arginyl-L-tyrosyl— L-phenylalanyl-L-prolyl-L-asparaginyl-L- 

alanyl-L-prolyl-L-tyrosyl- (CA INDEX NAME) 
OTHER NAMES: 

CN 1: PN: WO03106682 SEQID: 2 claimed sequence 

CN 36: PN: WO2005045027 SEQID: 36 unclaimed protein 

CN 3: PN: WO20050011I7 SEQID: 11 unclaimed protein 

FS PROTEIN SEQUENCE; STEREOSEARCH 

SQL 9 

PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source | Reference 



Not Given|WO2003106682 
| claimed SEQID 
12 



SEQ 1 RYFPNAPYL 



HITS AT: 1-9 

MF C56 H77 N13 013 

SR CA 

LC STN Files: ■ CA, CAPLUS, TOXCENTER, USPATFULL 
DT.CA CAplus document type: Patent 

RL.P Roles from patents: BIOL (Biological study); PRP (Properties); USES 
(Uses) 

Absolute stereochemistry. 
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PAGE 1-A 



HO' 





H02C' S ^Bu-i Me ° 




PAGE 1-B 




(CH 2 )3 




NH2 



NH 



**PROPERTY DATA AVAILABLE IN THE 'PROP' FORMAT** 

•4 REFERENCES IN FILE CA (1907 TO DATE) 

4 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 53 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 626129-45-1 REGISTRY 

CN Ribonuclease H (Bdellovibrio bacteriovorus strain HD100 gene rnhA) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 
CN GenBank CAE80890 

CN GenBank CAE80890 (TRANSLATED FROM: GenBank BX842654) 
FS PROTEIN SEQUENCE 
SQL 263 

SEQ 1 MTEKPRLVRM LGVSTNTRDH IMIYSDGACS GNPGPGGWGS VILYPDNQVQ 

51 ELGDGEKSTT NNRMEMTAAL EALKAVAHMK VPVRFYTDST YLIRGITQWV 
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101 HGWRRRGWKT AEGGEVSNQD IWEDLSHVVA ARGTQGKIEW HYSRGHVGIP 
151 GNERCDRIAV AFSKNDYVSL YSGSLDQYPY DLLQVPADTS LPEMKGPGEK 
201 KKEAFSYLSN LGGLVYRHRT WPACQKRVSG QSGAKFKKAT SAAEEIEILK 
251 SWGLSPSTVI KEG 

HITS AT: 84-92 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 54 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 615219-79-9 REGISTRY 

CN Cell division protein (Corynebacterium diphtheriae strain NCTC13129) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 
CN GenBank CAE49271 

CN GenBank CAE49271 (Translated from: GenBank BX248356) 
FS PROTEIN SEQUENCE 
SQL 300 



SEQ 1 MKLGFVFREA FRGLGRNITM TIALIITTAI SLALLATGFL VTNMTKETKD 

51 I YLDRVEVMV QLNENISAND KDCSSQACRD VRDKLDGADG IETVTYRSRQ 

101 QSYDRFVEVF KDTDPQLVAE TSPDALPAAL HVRLEDPLDT KPLDQVRDME 

151 QVDTIVDQVD DLRGATDNLD AIRNSTFIFA AIQAIAAIFL IVNMVQIAAF 

201 NRREEISIMR MVGASRWYTQ APFVIEAMVA ALFGAILSGL ALFGGKVWVV 



251 DKTLKGLYDS QLIARVSNAD IWAVAPVVAV IGIIFAAITA QATLRWYVRK 
HITS AT: 216-224 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS, TOXCENTER 
DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 55 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 613213-57-3 REGISTRY 

CN 1700021K19Rik protein (mouse strain C57BL/6 clone MGC : 79154 IMAGE : 570 6123 ) 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank AAH60601 

CN • GenBank AAH60601 (Translated from: GenBank BC060601) 
FS PROTEIN SEQUENCE 
SQL 956 

SEQ 1 MRPEGAGMDL GGGDGERLLE KSRREHWQLL GNLKTTVEGL VSANCPNVWS 

51 KYGGLERLCR DMQNILYHGL IHDQVCCRQA DYWQFVKDIR WLSPHSALHV 
101 EKFISLHESD QSDTDSVSER AVAELWLQHS LQCHCLSAQL RPLLGDRQYI 
151 RKFYTETAFL LSDAHVTAML QCLEAVEQNN PRLLAQIDAS MFARKQESPL 



201 LVTKSQSLTA LPGSTYTPPA SYAQHSYFGS SSSLQSMPQS SHSSERRSTS 
251 FSLSGPSWQP QEDRECLSPA ETQTTPAPLP SDSTLAQDSP LTAQEMSDST 
301 LTSPLEASWV SSQNDSPSDV SEGPEYLAIG NPAPHGRTAS CESHSSNGES 
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351 SSSHLFSSSS SQKLESAASS LGDQEEGRQS QAGSVLRRSS FSEGQTAPVA 
401 SGTKKSHIRS HSDTNIASRG AAGGPRNITI IVEDPIAEGG QYLCSGEGMF 
4 51 RRPSEGQSLI SYLSEQDFGS CADLEKENAH FSISESLIAA IELMKCNMMS 
501 QCLEEEEVEE EDSDREIQEL KQKIRLRRQQ IRTKNLLPAY RETENGSFRV 
551 TSSSSQFSSR DSTQLSESGS AEDADDLEIQ DADIRRSAVS NGKSSFSQNL 
601 SHCFLHSTSA EAVAMGLLKQ FEGMQLPAAS ELEWLVPEHD , APQKLLPIPD 
651 SLPISPDDGQ HADI YKLRIR VRGNLEWAPP RPQIIFNVHP ' APTRKIAVAK 
701 QNYRCAGCGI RTDPDYIKRL RYCEYLGKYF CQCCHENAQM VVPSRILRKW 
751 DFSKYYVSNF SKDLLLKIWN DPLFNVQDIN SALYRKVKLL NQVRLLRVQL 
801 YHMKNMFKTC RLAKELLDSF DVVPGHLTED LHLYSLSDLT ATKKGELGPR 
851 LAELTRAGAA HVERCMLCQA KGFICEFCQN EEDVIFPFEL HKCRTCEECK 
901 ACYHKTCFKS GRCPRCERLQ ARRELLAKQS LESYLSDYEE EPTEALALEA 
951 TVLETT 

HITS AT: 152-160 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
2 REFERENCES IN FILE CA (1907 TO DATE) 
2 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 56 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 

RN 604834-47-1 REGISTRY 

CN Protein (Candida albicans gene CaYHR005C) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2532: PN : US20030180953 SEQID: 7668 claimed protein 

FS PROTEIN SEQUENCE 

SQL 429 



PATENT ANNOTATIONS (PNTE) : 
Sequence I Patent 
Source I Reference 

Not Given|US2003180953 
| claimed SEQID 
17668 



SEQ 1 MGCGASVPVD DDEIDLFLQD KRINDAIEQS LQLRQQNSKK GVKLLLLGAG 

51 ESGKSTVLKQ LKLLHKGGFT QQERRQYSHV IWCDVIQSMK VLI IQARKLK 

101 IKLDCDQPNN SLIPYKQIIL RSDPLKQIDA DVAGGTDFLN DFVVKYSEEN 

151 KNKRRLKSTG TTDIWGKDDD SNINSDAINQ ALESSLNKDS EQFTRLSIAE 

201 AIHKLWKLDS GIKKCFDRSN EFQLEGSADY YFDNVFNFAD TNYLSTDLDI 

251 LKGRIKTTGI TETDFLIKSF QFKVLDAGGQ RSERKKWIHC FEDITAVLFV 

301 LAISEYDQNL FEDERVNRMH ESIVLFDSLC NSKWFANTPF ILFLNKIDIF 



351 ENKIKKNPLK NYFPDYDGKP DDTNEAIKFF ETNFLKINQT NKPIYVHRTC 
401 ATDSKSMKFV LSAVTDMIVQ QNLKKSGII 
HITS AT: 333-341 

**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS, USPATFULL 
DT.CA CAplus document type: Patent 

RL.P Roles from patents: BIOL (Biological study); PRP (Properties); USES 
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(Uses) 



1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 57 OF 82 REGISTRY 
RN 593941-10-7 REGISTRY 
CN Gamma-glutamyltranspeptidase 

ggt) (9CI) (CA INDEX NAME)' 
OTHER NAMES: 
CN GenBank BAC90945 
CN GenBank BAC90945 
FS PROTEIN SEQUENCE 
SQL 618 



COPYRIGHT 2008 ACS on STN 



(Gloeobacter violaceus strain PCC 7421 gene 



(Translated from: GenBank AP006578) 



SEQ 



1 MRISANTDRI 

51 AAFVIGSAAP 

101 GVVEPQSSGI 

151 VIPRLSLDGY 

201 LSDLVRGRIE 

251 WVNSGGDSFY 

301 GVPVVTMPPP 

351 RSRFYGDPAF 



EAFCQDSGSV 
VAALDGVVST 
GGGGFALVWL 
RAAGVPGQAA- 
AAAERLGRDP 
RGETAAAMAA 
GSGGVLLEML 
VPVPLAALVA 



WNRRGVHRYV 
AHPLATEAGI 
GRTAEARVLD 
GLVKLQQAYA 
EASRVFLVEG 
AMTERGGLVS 
NTLENFDLAA 
KSYGAAQKKT 



TPGSEHFRDR 
EMLRKGGNAV 
FREAAPGQAA 
ALPLATLAEP 
KAPPAGYRLR 
RADLESYSPV 
QTPLLRTHLT 
ISLERATPST 



LLNHRTLPLL 
DAAIAASLAL 
AAMYLDKAGK 
AIRLAREGFS 
QPDLARTLEL 
WRTPLVGSYR 
AQAMSRAYAD 
QIVPGSLLSP 



401 ALLAGFQSWE GPRPVENTSH LSTMDADGNT VALTQTINGG FGAAVVAPKT 
4 51 GILLNNEMDD FAVAPGVPNL FGLVGSKANA IAPGKRPLSS MTPTILLKNG 
501 RPWIALGSPG GAFITNAVLQ TILGVLDYQL SLPEAVNAPR IHHQWQPDRL 
551 FVEKAYPVAQ ETLSALGYGL RAIDAMGNVQ AVQYAGRFVG ASDGRREGTS 
601 AIWQTPARAA SPAASPRR 

HITS AT: 353-361 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 58 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 593928-81-5 REGISTRY 

CN Protein (Gloeobacter violaceus strain PCC 7421 gene glll779) 

INDEX NAME) 
OTHER NAMES: 
CN GenBank BAC89720 

CN GenBank BAC89720 (Translated from: GenBank AP006574) 
FS PROTEIN SEQUENCE 
SQL 356 



SEQ 



1 MKVGILSFHH 

51 RELLPSRPVL 

101 VVGSDQVWST 

151 NVQSLVRQFQ 

201 LAI DSPFLLI 

251 GIGPEEWVRY 



TSNYGATLQN 
SNLAKILKMK 
DGWRGYDPSF 
AISVRDANTA 
YKMGPMSSSE 
YSQAAYVFTD 



YALWKTIDNL 
QFRESRIQLS 
FLDFI DGETT 
RVIKEECALD 
ESFILRLAGE 
SYHGSIFSII 



GFKAEVIDYQ 
EGVYFSSEGL 
LKISYAASAG 
SIQVLDPTFL 
RNLKIVSVGY 
FHKPFNVFVC 



PFVAMKNYYV 
KRFKRCYDAV 
HTETLGAHRS 
VGYGEFLGKA 
HHRIAQYNLA 
KTKAAKTGDL 



301 LNKLGLIERI FDGDKTNTSN IDYAPVQERL TRRVQASRDF LIRALSSRLA 

351 TAKSSG 
HITS AT: 259-267 
MF Unspecified 



(9CI) (CA 
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CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 59 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 588638-49-7 REGISTRY 

CN 1700021K19Rik protein (mouse strain C57BL/6 clone IMAGE : 5704785 gene 

1700021K19Rik) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank AAH57307 

CN GenBank AAH57307 (Translated from: GenBank BC057307) 
FS PROTEIN SEQUENCE 
SQL 992 

SEQ 1 KLTSHGASPL LRVLRVPWRS KRPGASLWPR DGLIVRMRPE GAGMDLGGGD 

51 GERLLEKSRR EHWQLLGNLK TTVEGLVSAN CPNVWSKYGG LERLCRDMQN 
101 ILYHGLIHDQ VCCRQADYWQ FVKDIRWLSP HSALHVEKFI SLHESDQSDT 
151 DSVSERAVAE LWLQHSLQCH CLSAQLRPLL GDRQYIRKFY TETAFLLSDA 



201 HVTAMLQCLE AVEQNNPRLL AQIDASMFAR KQESPLLVTK SQSLTALPGS 
251 TYTPPASYAQ HSYFGSSSSL QSMPQSSHSS ERRSTSFSLS GPSWQPQEDR 
301 ECLSPAETQT TPAPLPSDST LAQDSPLTAQ EMSDSTLTSP LEASWVSSQN 
351 DSPSDVSEGP EYLAIGNPAP HGRTASCESH SSNGESSSSH LFSSSSSQKL 
401 ESAASSLGDQ EEGRQSQAGS VLRRSSFSEG QTAPVASGTK KSHIRSHSDT 
451 NIASRGAAGG PRNITIIVED PIAEGGQYLC SGEGMFRRPS EGQSLISYLS 
501 EQDFGSCADL EKENAHFSIS ESLIAAIELM KCNMMSQCLE EEEVEEEDSD 
551 REIQELKQKI RLRRQQIRTK NLLPAYRETE NGSFRVTSSS SQFSSRDSTQ 
601 LSESGSAEDA DDLEIQDADI RRSAVSNGKS SFSQNLSHCF LHSTSAEAVA 
651 MGLLKQFEGM QLPAASELEW LVPEHDAPQK LLPIPDSLPI SPDDGQHADI 
7 01 YKLRIRVRGN LEWAPPRPQI IFNVHPAPTR KIAVAKQNYR CAGCGIRTDP 
751 DYIKRLRYCE YLGKYFCQCC HENAQMVVPS RILRKWDFSK YYVSNFSKDL 
801 LLKIWNDPLF NVQDINSALY RKVKLLNQVR LLRVQLYHMK NMFKTCRLAK 
851 ELLDSFDVVP GHLTEDLHLY SLSDLTATKK GELGPRLAEL TRAGAAHVER 
901 CMLCQAKGFI CEFCQNEEDV IFPFELHKCR TCEECKACYH KTCFKSGRCP 
951 RCERLQARRE LLAKQSLESY LSDYEEEPTE ALALEATVLE TT 

HITS AT: 188-196 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



L2 ANSWER 60 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 571229-35-1 REGISTRY 

CN Membrane protein (Corynebacterium efficiens strain YS-314) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 
CN GenBank BAC17623 

CN GenBank BAC17623 (Translated from: GenBank AP005216) 

FS PROTEIN SEQUENCE \ 
SQL 300 
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SEQ 1 MSVTYVLRES LRGMARNLTM TLALVITTSI SLALLATGFL VANMTERTKD 

51 IYLDRVEVMI QLDEDTSAND PDCVEEACAE VRSELEALDG IDSITYRSRE 

101 QSYERFVEVF EYTDPVLVAE TSPDALPAAF HVRLADPLAV DILDPVRELP 

151 QVAAVIDQVD DLRGATDNLD SIRNATFLVA AVQILASTFL IANMVQIAAY 

201 SRREETEIMR IVGASRWYSE APFVLEAVLS TLIGAVLAVV GLFLGKELVI 



251 DKALRGLYES QLIAPITNAD IWTVAPVVSV IGVVVAGLIA QLTLRFYVRK 
HITS AT: 216-224 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP '(Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS # (1907 TO DATE) 

L2 ANSWER 61 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 570456-75-6 REGISTRY 

CN Protein (Proteus mirabilis strain ATCC202157 clone US6605709-SEQID-6558 

open reading frame) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2386: PN : US6605709 SEQID: 6558 claimed protein 
FS PROTEIN SEQUENCE 
SQL 264 

PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source | Reference 

= = ======= + = = = = = = = ==:==== 

Not Given|US6605709 

I claimed SEQID 
I 6558 



SEQ 



1 EVIMDRKLMP ALFVGHGSPM NVLEDNKYTR LWTTLGETLP KPKAILVISA 
51 HWYTQGTYIT AMTHPKTIHD FYGFPPELYQ IEYPAKGSIG LVALIEDLID 



101 PMKLKLDMEQ WGFDHGSWGI LEKMYPNANI PVVQLSIDAN QSPQWHYEFG 
151 KKLVELRREG VLVIGSGNIV HNLRMMDWQN DQAEPYSWAL SFSETVERCL 
201 QSDKVPEALF TILSTQEGQL AHPTAEHFLP ALYLLGLKQP DEKVTLLNND ■ 
251 IINKSLSMMT FQIG 

HITS AT: 51-59 

MF Unspecified 

CI MAN 

SR CA 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Patent 

RL.P Roles from patents: BIOL (Biological study); PRP (Properties) 
(Uses) 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



USES 



L2 ANSWER 62 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 570441-91-7 REGISTRY 

CN Protein (Proteus mirabilis strain ATCC202157 clone US66057 09-SEQID-50 68 

open reading frame) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 896: PN : US6605709 SEQID: 5068 claimed protein 
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FS PROTEIN SEQUENCE 
SQL 493 

PATENT ANNOTATIONS (PNTE): 
Sequence | Patent 
Source | Reference 



Not Given|US6605709 

I claimed SEQID 
15068 



SEQ 1 VFSPFGVLYL EIGVGAYFFF SQSQRGNNAM VMQHAKPGRV ALAVMGALLV 

51 TALPAKMSYA EGFIDDSTLT GGIYYWQRDR DRKELTPNSP DYGKYKANLH 

101 HSTFNANLDF SSGYLGDFFG • IDLAAFGAVE LNNSGPAAPN EIGFSDAKSR 

151 WDEKWTGDRS GVSI YKAAAK FKLGHFWAQG GYIQPKGQTL LRPHWSFLPG 



201 TYRGAEAGAV FDFDKNGELS FSYMWTDEYK APWYRNMYNF RKADLETNIS 
251 YLHSFGAKYD FKNSLVLEAA FGQADGYIDQ YFGKVSYDFP IANNTLKTSY 
301 QFYGAKDKVT GGGINDVYDG LAWLQAFTLG YTYNVFDFKV EGTWVKAEGN 
351 QGYFLQRMTP GWATSNGRLD IWWDGRSDFN ANGEKALYAG VMYDLKNWDL 
4 01 PGWAIGTSYV YAWDAKPSSN PIYDQSKRIR ESAWNADIMY TVQEGRAKGT 
451 LFKLHYTRYD NHSDIPSYEG GFGNIFQDEK DVKFNVIMPF TIF 

HITS AT: 175-183 

MF Unspecified 

CI MAN 

SR CA 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Patent 

RL.P Roles from patents: BIOL (Biological study); PRP (Properties); USES 
(Uses) 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 63 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 568498-90-8 REGISTRY 

CN Glycosyltransferase ( Synechococcus strain WH8102 gene SYNW0416) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 
CN GenBank CAE06931 

CN GenBank CAE06931 (Translated from: GenBank BX569690) 
FS PROTEIN SEQUENCE 
SQL 661 

SEQ 1 MNGAGPNGQS MSILYVCHGH PRYAKGGGEL AAWRLFQAFE AEGAAFLAAA 

51 PSLAVMPPGC EVMSAGPRQW LIKPSLSPLQ HGTEVSLQDG GALHQALADL 
101 RPELVHLHHY VHVGIDLVHA LRRWFPQAGF LFTLHEYWGP CAYEGRLLRR 



151 SGELCAGPEP EVCVECVGPE QRVDLAIRQL RLQRMFAAID HLLSPSLFLK 

201 QRYEEWGLDP HRISVVENLP APAPASVESG ANAGSDALVL GYFGQVNPWK 

251 GLELILQAVQ RARRRCGQLS VQLHGCGPAD LDPSTSPYPE LAGRLAALVE 

301 QLGPEAVQLC GRYDSDQLAD RMAGVDAVLM ASTWFENAPM VIQEAFQHGR 

351 PVLAPRLGGM AEKVQHELSG LLFAPGDPAD LARTIERCID EVDLLPGLQA 

401 QVLRMPLKGA RSLEQHRRLY ARFRGSSAAA EPSLAEWEPA LLEDLRQNFE 

4 51 PLGANCELGF LMGRLGIQRS SLFRWLFTPL ASLEKVLAEG LERFFSEPVA 

501 VTDPVMAADM VVDQRTGIFF HAGELRQQLE QAGNNSAALV QVRAGEACRD 

551 QRGKYRYLVE KFLASVPVAS TLHVFSDFHR ELNEAAMLRI HSRLRDLGKP 

601 LGSTLLFVQP ADGSTEVNSV RCLGDGLQLA AIRRFAPGEH ADQIDLEAWI 

651 SLLSLSRQLS S 
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HITS AT: 123-131 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document ' type : Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 64 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 550280-77-8 REGISTRY 

CN Protein (megaplasmid pHGl gene PHG068) (9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN GenBank AAP85821 

CN GenBank AAP85821 (Translated from: GenBank AY305378) 
FS PROTEIN SEQUENCE 
SQL 215 

SEQ 1 MPDLTFSISA AEVAPFAAVP LLLFKLQIAN APAGEEIASI TLQCQIQIEA 

51 TRRRYAPAEH DGLQDLFGVP QRWTQTLRTM LWTHATVIVP AFSGTCVIDL 
101 LVPCSFDFNI AATKYCHALQ DGEI PLMLQF SGTVFYRNAD DALQAAPVPW 
151 HKEAAFRLPV AVWHDMMARY YPNGAWLCLQ REVFDRLGRY KAQQGLATWE 



201 RALDALLDGA EEKAS ■ 
HITS AT: 169-177 
MF Unspecified 
CI MAN 
SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 65 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN ,502676-38-2 REGISTRY 

CN MKIAA0226 protein (mouse clone mbg06042 gene mKIAA0226 C-terminal 

fragment) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank BAC65503 

CN GenBank BAC65503 (Translated from: GenBank AK122221) 
FS PROTEIN SEQUENCE 
SQL 1003 



SEQ 1 LCRCQGVVHG SKLTSHGASP LLRVLRVPWR SKRPGASLWP RDGLIVRMRP 

51 EGAGMDLGGG DGERLLEKSR REHWQLLGNL KTTVEGLVSA NCPNVWSKYG 

101 GLERLCRDMQ NILYHGLIHD QVCCRQADYW QFVKDIRWLS PHSALHVEKF 

151 ISLHESDQSD TDSVSERAVA ELWLQHSLQC HCLSAQLRPL LGDRQYIRKF 



201 YTETAFLLSD AHVTAMLQCL EAVEQNNPRL LAQIDASMFA RKQESPLLVT 



251 KSQSLTALPG STYTPPASYA QHSYFGSSSS LQSMPQSSHS SERRSTSFSL 

301 SGPSWQPQED RECLSPAETQ TTPAPLPSDS TLAQDSPLTA QEMSDSTLTS 

351 PLEASWVSSQ NDSPSDVSEG PEYLAIGNPA PHGRTASCES HSSNGESSSS 

401 HLFSSSSSQK LESAASSLGD QEEGRQSQAG SVLRRSSFSE GQTAPVASGT 

451 KKSHIRSHSD TNIASRGAAG GPRNITIIVE DPIAEGGQYL CSGEGMFRRP 

501 SEGQSLISYL SEQDFGSCAD LEKENAHFSI SESLIAAIEL MKCNMMSQCL 

551 EEEEVEEEDS DREIQELKQK IRLRRQQIRT KNLLPAYRET ENGSFRVTSS 
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601 SSQFSSRDST QLSESGSAED ADDLEIQDAD IRRSAVSNGK SSFSQNLSHC 
651 FLHSTSAEAV AMGLLKQFEG MQLPAASELE WLVPEHDAPQ KLLPIPDSLP 
701 ISPDDGQHAD I YKLRIRVRG NLEWAPPRPQ IIFNVHPAPT RKIAVAKQNY 
751 RCAGCGIRTD PDYIKRLRYC EYLGKYFCQC CHENAQMVVP SRILRKWDFS 
801 KYYVSNFSKD LLLKIWNDPL FNVQDINSAL YRKVKLLNQV RLLRVQLYHM 
851 KNMFKTCRLA KELLDSFDVV PGHLTEDLHL YSLSDLTATK KGELGPRLAE 
901 LTRAGAAHVE RCMLCQAKGF ICEFCQNEED VIFPFELHKC RTCEECKACY 
951 HKTCFKSGRC- PRCERLQARR ELLAKQSLES YLSDYEEEPT EALALEATVL 
1001 ETT 

HITS AT: 199-207 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 66 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 493570-38-0 REGISTRY 

CN Protein (mouse strain C57BL/6J clone A630091D15 941-amino acid) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 
CN GenBank BAC31257 

CN GenBank BAC31257 (Translated from: GenBank AK042428) 

CN Protein (Mus musculus strain C57BL/6J clone A630091D15 941-amino acid) 
FS PROTEIN SEQUENCE 
SQL 941 

SEQ 1 MRPEGAGMDL GGGDGERLLE KSRREHWQLL GNLKTTVEGL VSANCPNVWS 

51 KYGGLERLCR DMQNILYHGL IHDQVCCRQA DYWQFVKDIR WLSPHSALHV 
101 EKFISLHESD QSDTDSVSER AVAELWLQHS LQCHCLSAQL RPLLGDRQYI 
151 RKFYTETAFL LSDAHVTAML QCLEAVEQNN PRLLAQIDAS MFARKQESPL 



201 LVTKSQSLTA LPGSTYTPPA SYAQHSYFGS SSSLQSMPQS SHSSERRSTS 
251 FSLSGPSWQP QEDRECLSPA ETQTTPAPLP SDSTLAQDSP LTAQEMSDST 
301 LTSPLEASWV SSQNDSPSDV SEGPEYLAIG NPAPHGRTAS CESHSSNGES . 
351 SSSHLFSSSS SQKLESAASS LGDQEEGRQS QAGSVLRRSS FSEGQTAPVA 
401 SGTKKSHIRS HSDTNIASRG AAEGGQYLCS GEGMFRRPSE GQSLISYLSE 
4 51 QDFGSCADLE KENAHFSISE SLIAAIELMK CNMMSQCLEE EEVEEEDSDR 
501 EIQELKQKIR LRRQQIRTKN LLPAYRETEN GSFRVTSSSS QFSSRDSTQL 
551 SESGSAEDAD DLEIQDADIR RSAVSNGKSS FSQNLSHCFL HSTSAEAVAM 
601 GLLKQFEGMQ LPAASELEWL VPEHDAPQKL LPIPDSLPIS PDDGQHADIY 
651 KLRIRVRGNL EWAPPRPQII FNVHPAPTRK IAVAKQNYRC AGCGIRTDPD 
701 YIKRLRYCEY LGKYFCQCCH ENAQMVVPSR ILRKWDFSKY YVSNFSKDLL 
7 51 LKIWNDPLFN VQDINSALYR KVKLLNQVRL LRVQLYHMKN MFKTCRLAKE 
801 LLDSFDVVPG HLTEDLHLYS LSDLTATKKG ELGPRLAELT RAGAAHVERC 
851 MLCQAKGFIC EFCQNEEDVI FPFELHKCRT CEECKACYHK TCFKSGRCPR 
901 CERLQARREL LAKQSLESYL SDYEEEPTEA LALEAPVLET T 

HITS AT: 152-160 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
2 REFERENCES IN FILE CA (1907 TO DATE) 
2 REFERENCES IN FILE CAPLUS (1907 TO DATE) 
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L2 ANSWER 67 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 

RN 489034-87-9 REGISTRY 

CN GenBank BAB98196 (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank BAB98196 (Translated from: GenBank AP005276) 

FS PROTEIN SEQUENCE 

SQL 300 



SEQ 1 MAFGYVLREA VRGMGRNVTM TIALIITTSI SLALLATGFL VTNMTDRTKD 

51 IYLDRVEVMI QLDEDTSAND PECTAESCTE VRDVLEGLDG IDSITYRSRE 

101 ASYERFVEVF KDTDPVLVAE TSPDALPAAF HVRLEDPLAV EILDPVRDLP 

151 QVSNVIDQVD DLRGATENLD SIRNATFLIA AVQVLASIFL IANMVQIAAF 

201 NRREETEIMR IVGASRFYTQ GPFVFEAILS TLIGAVFAVG ALFLGKELVI 

251 DKALRGLYDS QLIAPVTTTD IWLVAPIISG IGVVIAGIIA QLTLRFYVRK 

HITS AT: 216-224 



* *RELATED SEQUENCES AVAILABLE WITH SEQLINK** 

MF Unspecified 

CI MAN 

SR GenBank 



L2 ANSWER 68 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 486378-27-2 REGISTRY 

CN Protein (mouse strain C57BL/6J clone 5330403K09 941-amino acid) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 
CN GenBank BAC26925 

CN GenBank BAC26925 (Translated from: GenBank AK030368) 

CN Protein (Mus musculus strain C57BL/6J clone 5330403K09 941-amino acid) 
FS PROTEIN SEQUENCE 
SQL 941 

SEQ 1 MRPEGAGMDL GGGDGERLLE KSRREHWQLL GNLKTTVEGL VSANCPNVWS 

51 KYGGLERLCR DMQNILYHGL IHDQVCCRQA DYWQFVKDIR WLSPHSALHV 
101 EKFISLHESD QSDTDSVSER AVAELWLQHS LQCHCLSAQL RPLLGDRQYI 
151 RKFYTETAFL LSDAHVTAML QCLEAVEQNN PRLLAQIDAS MFARKQESPL 



201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
HITS AT: 
MF Unspe 
CI MAN 
SR GenBa 
LC STN F 



LVTKSQSLTA 
FSLSGPSWQP 
LTSPLEASWV 
SSSHLFSSSS 
SGTKKSHIRS 
QDFGSCADLE 
EIQELKQKIR 
SESGSAEDAD 
GLLKQFEGMQ 
KLRIRVRGNL 
YIKRLRYCEY 
LKIWNDPLFN 
LLDSFDVVPG 
MLCQAKGFIC 
CERLQARREL 
152-160 
cif ied 



LPGSTYTPPA 
QEDRECLSPA 
SSQNDSPSDV 
SQKLESAASS 
HSDTNIASRG 
KENAHFSISE 
LRRQQIRTKN 
DLEIQDADIR 
LPAASELEWL 
EWAPPRPQII 
LGKYFCQCCH 
VQDINSALYR 
HLTEDLHLYS 
EFCQNEEDVI 
LAKQSLESYL 



SYAQHSYFGS 
ETQTTPAPLP 
SEGPEYLAIG 
LGDQEEGRQS 
AAEGGQYLCS 
SLIAAIELMK 
LLPAYRETEN 
RSAVSNGKSS 
VPEHDAPQKL 
FNVHPAPTRK 
ENAQMVVPSR 
KVKLLNQVRL 
LSDLTATKKG 
FPFELHKCRT 
SDYEEEPTEA 



SSSLQSMPQS 
SDSTLAQDSP 
NPAPHGRTAS 
QAGSVLRRSS 
GEGMFRRPSE' 
CNMMSQCLEE 
GSFRVTSSSS 
FSQNLSHCFL 
LPIPDSLPIS 
IAVAKQNYRC 
ILRKWDFSKY 
LRVQLYHMKN 
ELGPRLAELT 
CEECKACYHK 
LALEATVLET 



SHSSERRSTS 
LTAQEMSDST 
CESHSSNGES 
FSEGQTAPVA 
GQSLISYLSE 
EEVEEEDSDR 
QFSSRDSTQL 
HSTSAEAVAM 
PDDGQHADIY 
AGCGIRTDPD 
YVSNFSKDLL 
MFKTCRLAKE 
RAGAAHVERC 
TCFKSGRCPR 
T 



nk 

iles : 



CA, CAPLUS 
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DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
2 REFERENCES IN FILE CA (1907 TO DATE) 
2 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 69 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 

RN 486066-28-8 REGISTRY 

CN GenBank AAA34324 (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank AAA34324 (Translated from: GenBank M88113) 

FS PROTEIN SEQUENCE 

SQL 429 



SEQ ■ 1 MGCGASVPVD DDEIDPFLQD KRINDAIEQS LQLRQQNSKK GVKLLLLGAG 

51 ESGKSTVLKQ LKLLHKGGFT QQERRQYSHV IWCDVIQSMK VLIIQARKLK 

101 IKLDCDQPNN SLIPYKQIIL RSDPLKQIDA SVAGGTDFLN DFVVKYSEEN 

151 KNKRRLKSTG TTDIWGKDDD SNINSDAINQ ALESSLNKDS EQFTRLSIAE 

201 AIHKLWKLDS GIKKCFDRSN EFQLEGSADY YFDNVVNFAD TNYLSTDLDI 

251 LKGRIKTTGI TETDFLIKSF QFKVLDAGGQ RSVRKKWIHC FEDITAVLFV 

301 LAISEYDQNL FEDERVNRMH ESIVLFDSLC NSKWFANTPF ILFLNKIDIF 



351 ENKIKKNPLK NYFPDYDGKP DDTNEAIKFF ETNFLKINQT NKPIYVHRTC 

4 01 ATDSKSMKFV LSAVTDMIVQ QNLKKSGIM 
HITS AT: 333-341 
MF Unspecified 
CI MAN 
SR GenBank 



L2 ANSWER 70 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 

RN 481411-39-6 REGISTRY 

CN GenBank CAD34 629 (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank CAD34629 (Translated from: GenBank AX404619) 

FS PROTEIN SEQUENCE 

SQL 300 



SEQ 1 MAFGYVLREA VRGMGRNVTM TIALIITTSI SLALLATGFL VTNMTDRTKD 

51 IYLDRVEVMI QLDEDTSAND PECTAESCTE VRDVLEGLDG IDSITYRSRE 

101 ASYERFVEVF KDTDPVLVAE TSPDALPAAF HVRLEDPLAV EILDPVRDLP 

151 QVSNVIDQVD DLRGATENLD SIRNATFLIA AVQVLASIFL IANMVQIAAF 

201 NRREETEIMR IVGASRFYTQ GPFVFEAILS TLIGAVFAVG ALFLGKELVI 



251 DKALRGLYDS QLIAPVTTTD IWLVAPIISG IGVVIAGIIA QLTLRFYVRK 
HITS AT: 216-224 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 

MF Unspecified 

CI MAN 

SR GenBank 



L2 ANSWER 71 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 481150-69-0 REGISTRY 

CN KIAA0226 protein (human cell line KG-1 clone ha046331 gene KIAA0226) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 

CN 1305: PN: WO03091391 TABLE: 20 unclaimed protein 
CN 2200: PN: US2004 00094 8 1 TABLE: 1 claimed protein 
CN 4295: PN : WO03038130 FIGURE: 3 claimed protein 
CN 456: PN : WO03095618 TABLE: 1 claimed protein 
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CN 495: PN : WO2004038376 TABLE: 4 unclaimed protein 
CN 689: PN : WO2004038376 TABLE : 5 unclaimed protein 
CN GenBank BAA13215 

CN GenBank BAA13215 (Translated from: GenBank D86979) 
FS PROTEIN SEQUENCE 
SQL 973 

PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source (Reference 



Not Given|WO2003038130 

I claimed FIGURE 
13 



SEQ 1 DLSFPREGRE HWQLLGNLKT TVEGLVSTNS PNVWSKYGGL ERLCRDMQSI 

51 LYHGLIRDQA CRRQTDYWQF VKDIRWLSPH SALHVEKFIS VHENDQSSAD 
101 GASERAVAEL WLQHSLQYHC LSAQLRPLLG DRQYIRKFYT DAAFLLSDAH 



151 VTAMLQCLEA VEQNNPRLLA QIDASMFARK HESPLLVTKS QSLTALPSST 
201 YTPPNSYAQH SYFGSFSSLH QSVPNNGSER RSTSFPLSGP PRKPQESRGH 
251 VSPAEDQTIQ APPVSVSALA RDSPLTPNEM" SSSTLTSPIE ASWVSSQNDS 
301 PGDASEGPEY LAIGNLDPRG RTASCQSHSS NAESSSSNLF SSSSSQKPDS 
351 AASSLGDQEG GGESQLSSVL RRSSFSEGQT LTVTSGAKKS HIRSHSDTSI 
401 ASRGAPGGPR NITIIVEDPI AESCNDKAKL RGPLPYSGQS SEVSTPSSLY 
4 51 MEYEGGRYLC SGEGMFRRPS EGQSLISYLS EQDFGSCADL EKENAHFSIS 
501 ESLIAAIELM KCNMMSQCLE EEEVEEEDSD REIQELKQKI RLRRQQIRTK 
551 NLLPMYQEAE HGSFRVTSSS SQFSSRDSAQ LSDSGSADEV DEFEIQDADI 
601 RRNTASSSKS FVSSQSFSHC FLHSTSAEAV AMGLLKQFEG MQLPAASELE 
651 WLVPEHDAPQ KLLPIPDSLP ISPDDGQHAD IYKLRIRVRG NLEWAPPRPQ 
701 IIFNVHPAPT RKIAVAKQNY RCAGCGIRTD PDYIKRLRYC EYLGKYFCQC 
751 CHENAQMAIP SRVLRKWDFS KYYVSNFSKD LLIKIWNDPL FNVQDINSAL 
801 YRKVKLLNQV RLLRVQLCHM KNMFKTCRLA KELLDSFDTV PGHLTEDLHL 
851 YSLNDLTATR KGELGPRLAE LTRAGATHVE RCMLCQAKGF ICEFCQNEDD 
901 I IFPFELHKC RTCEECKACY HKACFKSGSC PRCERLQARR EALARQSLES 
951 YLSDYEEEPA EALALEAAVL EAT 

HITS AT: 137-145 

MF Unspecified 

CI MAN 

SR • GenBank 

LC STN Files: CA, CAPLUS, TOXCENTER, USPATFULL 
DT.CA CAplus document type: Patent 

RL.P Roles from patents: BIOL (Biological study); PRP (Properties) 
6 REFERENCES IN FILE CA (1907 TO DATE) 
6 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 72 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 479898-21-0 REGISTRY 

CN Similar to KIAA0226 gene product (human clone MGC: 40578 IMAGE : 5217239) 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank AAH33615 

CN GenBank AAH33615 (Translated from: GenBank BC033615) 
FS PROTEIN SEQUENCE 
SQL 375 

SEQ 1 MQSILYHGLI RDQACRRQTD YWQFVKDIRW LSPHSALHVE KFISVHENDQ 

51 SSADGASERA VAELWLQHSL QYHCLSAQLR PLLGDRQYIR KFYTDAAFLL 



Page 45 



0 



02/04/2008 



101 SDAHVTAMLQ CLEAVEQNNP RLLAQI DASM FARKHESPLL VTKSQSLTAL 
151 PSSTYTPPNS YAQHS YFGSF SSLHQSVPNN GSERRSTSFP LSGPPRKPQE 
201 SRGHVSPAED QTIQAPPVSV SALARDSPLT PNEMSSSTLT SPIEASWVSS 
251 QNDSPGDASE GPEYLAIGNL DPRGRTASCQ SHSSNAESSS SNLFSSSSSQ 
301 KPDSAASSLG DQEGGGESQL SSVLRRSSFS EGQTLTVTSG AKKSHIRSHS 
351 DTSIASRGAP GNEEHRLLVS KMTLN 

HITS AT: 91-99 

MF Unspecified 

CI MAN 

SR GenBank 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents : BIOL (Biological study); PRP (Properties) 
1 REFERENCES' IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 73 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 459773-35-4 REGISTRY 

CN GenBank- CAC98232 (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank CAC98232 (Translated from: GenBank AL591973) 
FS PROTEIN SEQUENCE 
SQL 372 

SEQ 1 MKSRKKGIIL VLSVILIFSI GLLVNNLMTN NKDTAKPKKK TVAAVKKKKE 

51 TPPKPKEPFN IDFTGDIMFD WDLRPVLAEK GMDYPFNNVR EELKSSDYTF 
101 VDLETAITTR TKKVPYQEFW IKSDPSSLTA LKNAGVDMVN ISNNHILDYY 
151 EDGLLDTTAA LRANNLAYVG AGKNEDEAYQ LKVADIKGNK VGFMSFCHFF 

201 PNTGWIADED TPGVTNGYDL NLVEEKIKEE RAKNKDIDYM VVYFHWGVEK 



251 TNTPVDYQTQ YVKKLVDDNL VDAIVASHPH WLQGFEVYKD VPIAYSLGNF 
301 LFPDYVSGHS AETGIYKLNF DQGKVTAHFD PGIISGNQIN MLEGSSKTAQ 
351 LNYLQSISPN ATINSNGDIS AK 
HITS AT: 198-206 



**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
MF Unspecified 
CI MAN 
SR CA 



L2 ANSWER 7 4 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 455367-88-1 REGISTRY 

CN Acetolactate -synthase (Thermosynechococcus elongatus strain BP-1 gene 

ilvB) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank BAC09376 

CN GenBank BAC09376 (Translated from: GenBank AP005375) 
FS PROTEIN SEQUENCE 
SQL 552 

SEQ 1 MPAIAHPMNT AELLVKCLEN EGVEYIFGLP GEENLDVLHA LHRSRIQFIT 

51 TRHEQGAAFM ADVYGRLTGK AGVCLSTLGP GATNLMTGVA DANLDGAPLV- 
101 AITGQVGTDR MHIESHQYLD LVAMFSPVTK WNAQIVRPSI TPEIVRKAFK 
151 IAQNEKPGAV HIDVPENIAA MEAEGAPLKP SSPEKTYASF QSILKAAELI 
201 NAAENPLILV GNGAIRAHAA AALTHFAEKL NIPVANTFMG KGVIPYQHPL 
251 ALWTVGLQQR DYISCGFDHA DLVIAVGYDL IEYSPKSWNP EGRLPI IHIA 
301 ATPAEIDSSY .IPVVEVVGDI SDSLYEILKR ADRQDKPTPY AVQLRQEIVA 
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351 DYYQYAQDES FPVKPQKLIY DLRQVMGPED IVISDVGAHK MWIARHYHCD 
4 01 RPNTCLISNG FAAMGIAVPG AMAAKLVYPQ RHVVAVTGDG GFMMNFQELE 
451 TALRMGTNFT TIIFNDGGYG LIEWKQQRYF GESAYVHFSN PDFVKLAESM 



501 GLKGYRIEET TDFIPTLKTA LAQNVPTLID VPVDYSENLR FNQRLGELSC 

551 TP 
HITS AT: 478-486 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 75 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 443166-78-7 REGISTRY 

CN Protein (Candida albicans clone Sa386383_7668 gene fragment) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 

CN 668: PN : WO02053728 SEQID: 7668 claimed protein 
FS PROTEIN SEQUENCE 
SQL 429 

PATENT ANNOTATIONS (PNTE) : 
Sequence I Patent 
Source (Reference 

Not Given|WO2002053728 
I claimed SEQID 
17668 



GVKLLLLGAG 
VLIIQARKLK 
DFVVKYSEEN 
EQFTRLSIAE 
TNYLSTDLDI 
FEDITAVLFV 
ILFLNKIDIF 



351 ENKIKKNPLK NYFPDYDGKP DDTNEAIKFF ETNFLKINQT NKPIYVHRTC 
401 ATDSKSMKFV LSAVTDMIVQ QNLKKSGII 
HITS AT: 333-341 

RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS, TOXCENTER 
DT.CA CAplus document type: Patent 

RL.P Roles from patents: ANST (Analytical study); BIOL (Biological study); 
PRP (Properties); USES (Uses) 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 76 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 405119-76-8 REGISTRY 



SEQ 1 MGCGASVPVD DDEIDLFLQD KRINDAIEQS LQLRQQNSKK 

51 ESGKSTVLKQ LKLLHKGGFT QQERRQYSHV IWCDVIQSMK 

101 IKLDCDQPNN SLIPYKQIIL RSDPLKQIDA DVAGGTDFLN 

151 KNKRRLKSTG TTDIWGKDDD SNINSDAINQ ALESSLNKDS 

201 AIHKLWKLDS GIKKCFDRSN EFQLEGSADY YFDNVFNFAD 

251 LKGRIKTTGI TETDFLIKSF QFKVLDAGGQ RSERKKWIHC 

301 LAISEYDQNL FEDERVNRMH ESIVLFDSLC NSKWFANTPF 
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CN Cell division protein (Corynebacterium glutamicum ATCC13032 gene ftsX) 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2: PN: WO0222670 SEQID: 2 claimed protein 
FS PROTEIN SEQUENCE 
SQL 300 

PATENT ANNOTATIONS (PNTE) : 
Sequence | Patent 
Source (Reference 



Not Given) WO2002022670 
I claimed SEQID 
12 



SEQ 1 MAFGYVLREA VRGMGRNVTM TIALIITTSI SLALLATGFL VTNMTDRTKD 

51 IYLDRVEVMI QLDEDTSAND PECTAESCTE VRDVLEGLDG IDSITYRSRE 

101 ASYERFVEVF KDTDPVLVAE TSPDALPAAF HVRLEDPLAV EILDPVRDLP 

151 QVSNVIDQVD DLRGATENLD SIRNATFLIA AVQVLAS I FL IANMVQIAAF 

201 NRREETEIMR IVGASRFYTQ GPFVFEAILS TLIGAVFAVG ALFLGKELVI 



251 DKALRGLYDS QLIAPVTTTD IWLVAPIISG IGVVIAGIIA QLTLRFYVRK 
HITS AT: 216-224 

* * RELATED' SEQUENCES AVAILABLE WITH SEQLINK** 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS, . TOXCENTER, USPATFULL 
DT.CA CAplus document type: Patent 

RL.P Roles from patents: BIOL (Biological study); PREP (Preparation); PRP 
(Properties); USES (Uses) 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 77 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 367564-96-3 REGISTRY 

CN Protein (Listeria monocytogenes strain EGD-e gene LM-587.1) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 

CN 2526: PN: WO0177335 SEQID: 2527 claimed protein 

CN Protein (Listeria monocytogenes, strain EGD-e gene lmo0017) 

FS PROTEIN SEQUENCE 

SQL 372 

PATENT ANNOTATIONS (PNTE): 
Sequence I Patent 
Source I Reference 



Not Given|WO2001077335 
I claimed SEQID 
12527 



SEQ 1 MKSRKKGIIL VLSVILIFSI GLLVNNLMTN NKDTAKPKKK TVAAVKKKKE 

51 TPPKPKEPFN IDFTGDIMFD WDLRPVLAEK GMDYPFNNVR EELKSSDYTF 

101 VDLETAITTR TKKVPYQEFW IKSDPSSLTA LKNAGVDMVN ISNNHILDYY 

151 EDGLLDTTAA LRANNLAYVG AGKNEDEAYQ LKVADIKGNK VGFMSFCHFF 
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201 PNTGWIADED TPGVTNGYDL NLVEEKIKEE RAKNKDIDYM VVYFHWGVEK 



251 TNTPVDYQTQ YVKKLVDDNL VDAIVASHPH WLQGFEVYKD VPIAYSLGNF 
301 LFPDYVSGHS AETGI YKLNF DQGKVTAHFD PGIISGNQIN MLEGSSKTAQ 
351 LNYLQSISPN ATINSNGDIS AK 
HITS AT: 198-206 

**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS, USPATFULL 
DT.CA CAplus document type: Journal; Patent 

RL.P Roles from patents: ANST (Analytical study); BIOL (Biological study); 

OCCU (Occurrence); PRP (Properties); USES (Uses) 
RL.NP Roles from non-patents: BIOL (Biological study); PRP (Properties) 

2 REFERENCES IN FILE CA (1907 TO DATE) 

2 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 78 OF 82 REGISTRY COPYRIGHT .2008 ACS on STN 
RN * 364091-06-5 REGISTRY 

CN Protein (Corynebacterium glutamicum strain ATCC13032 clone 

EP1108790-SEQID-4396) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 895: PN: EP1108790 SEQID:4396 claimed protein 
FS PROTEIN SEQUENCE 
SQL 300 

PATENT ANNOTATIONS (PNTE): 
Sequence | Patent 
Source | Reference 



Not Given|EP1108790 

I claimed SEQID 
14396 



SEQ 1 MAFGYVLREA VRGMGRNVTM TIALIITTSI SLALLATGFL VTNMTDRTKD 

51 IYLDRVEVMI QLDEDTSAND PECTAESCTE VRDVLEGLDG IDSITYRSRE 

101 ASYERFVEVF KDTDPVLVAE TSPDALPAAF HVRLEDPLAV EILDPVRDLP 

151 QVSNVIDQVD DLRGATENLD SIRNATFLIA AVQVLAS I FL IANMVQIAAF 

201 NRREETEIMR IVGASRFYTQ GPFVFEAILS TLIGAVFAVG ALFLGKELVI 



251 DKALRGLYDS QLIAPVTTTD IWLVAPIISG IGVVIAGIIA QLTLRFYVRK 
HITS AT: 216-224 

* *RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS, TOXCENTER, USPATFULL 
DT.CA CAplus document type: Patent 

RL.P Roles from patents: ANST (Analytical study); BIOL (Biological study); 
OCCU (Occurrence); PRP (Properties); USES (Uses) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 79 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
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RN 349152-71-2 REGISTRY 

CN G protein (guanine nucleotide-binding protein) (Kluyveromyces lactis 

strain WM37 gene gpal subunit a) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank AAD33674 
CN GenBank AAD33674 
FS PROTEIN SEQUENCE 
SQL 447 



(Translated from: GenBank AF135552) 



SEQ 1 MGCVASTGNY ENEDDPFIQN KRANDLIEQN LQQERNKNKN EVKLLLLGAG 

51 ESGKSTVLKQ MKLLHQGGFT HRERMQYGQV IWADAIESMR TLILQAGKLG 

101 IELDSDLKNA HSGQLVNTEL HQCKEKIFRA NTLDQIDARM AGGSEFLNEY 

151 VLKYNGIGSK KKRQTTLGFK ESNGADPEEE DETDAFLSEK LAGTSYTGSS 

201 ETSELKRIDQ STNEEIAYAI KKLWTQDKGI RQCFNRSSEF QLEGSASYYF 

251 DNIEKFARVD YVCDDMDILK VRIKTTGITE NSFKIGPSTF KVYDAGGQRS 

301 ERRKWIHCFE GITAVVFVIA ISEYDQMLFE DERVNRMHES IVLLDTLLNS 

351 RWFANTPFIL FLNKVDIFQE KVKRSPIRTW FPNYPGKLGD SETGLKYFES 



4 01 LFLSLNRSNK PIYVHRTCAT DTQSMRFVLG AVTDLVIQQN LKKSGIL 
HITS AT: 351-359 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study)/ PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 80 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 348664-71-1 REGISTRY 

CN Molybdopterin oxidoreductase, molybdopterin binding subunit (Sulfolobus 

solfataricus gene SSO1580) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank AAK41791 

CN GenBank AAK41791 (Translated from: GenBank AE006771)' 
FS PROTEIN SEQUENCE 
SQL 1050 



SEQ 1 MNYDSNIMVE NKGLRLNRRD FLKASAAAGL VMGLGYFASK YNFNIFVTRD 

51 AVNDEELQPG WQYSYVPSIC AFCSSTCDIL VETEYINGYM RALEIDGNPL 

101 SPLNEGKVCP RGRAGIFRTY NVDRLKTPLI RTGPKGTWSF REATWEETIN 

151 YIMQNIKQLN PQPYEFLLIG GGI PCANYKP YFIPFTLGTQ I PNINGTPMQ 

201 TCLFSDQVPI GFVIGGFDLH ATDMMDDMTY SSLIVAWGTS GIPAGI FVNR 

251 AIRYAKGIEN GAYVIAIDPR MSEAASKANL WIPAKPGSDL YIAMAI INYL 

301 IQNNYYDDEF VRYYTNAPFL AYKENDVVKL LEEDYDDGTV KAYYVYDEIS 



351 GQIIEVPPFT NTNKYDVNGN FIKPALNAPQ GLTYNGKQVQ TVFQFLAQKV 

4 01 SNYTLEYAAQ VADVPLSQLQ ELAFRIATMK PMTIITGLKG FFNDQAVQFR 

4 51 KAYATIMALT GNIDIRGGWV YSAVYREGIK KVVNAYNNMV SSGKTKPGIL 

501 LQRPEVLEQL PVNELPGAFF AMFPIIYVYN NPSFWITGVP ALSYAYNQVL 

551 KQQGKKPAGA YTLFEESGSY AAIKGQVTWN GQPYKPKVAM SYGGSPFNFK 

601 WDEYKQVLET TFYIMIDILP TEASLYADVI LPDVTYLERD EFFWDDGPAM 

651 DRAIRGRWQT IPVLWPNTAN GLDLFIMFAY MLNPQAGDAY IQWMARFAGV 

701 PLDILKSVIQ QEMPNYQQYL MKNNGYPQWG SFTAKAWREA QLSVLSEELN 

751 MPKEQILQTL RNNGVIVIKT VDDYFANHER IPWDLPAATP TGRIEI YSTI 

801 LYYYVIQNYG YDPVWDPIIA EI PPNWNGGY AVEDGVYLSP PTPYNYPTFK 

851 PTPPELFFIE YKIPQFAYTS SADNPLLMAI TSNSYHKDIL MRAWINPTTA 

901 AQLGINEGDW IAIERWKLPN PDGSIPKLIV RAHLTQWIRP DTIGVPEPFG 
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951 QRNPALTTAT KAVNEFGNQP VSVMWAPGRN PLGGYHMNEQ FTVRVRKATS 
1001 DEIQAATQLA SVQTPDTLPS QQAKVTTPNN SASQNDWQQY VSYGNVSTLG 
HITS AT: 312-320 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: BIOL (Biological study)/ PRP (Properties) 
1 REFERENCES IN FILE CA (1907 TO DATE) 
1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 81 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 209600-11-3 REGISTRY 

CN Carnitine transporter (Treponema pallidum gene TP0106) (9CI) (CA INDEX 

NAME) 
OTHER NAMES: 
CN GenBank AAC26554 

CN GenBank AAC26554 (Translated from: GenBank AE001195) 
FS PROTEIN SEQUENCE 
SQL 510 

SEQ 1 MQKEKCDFSV SLIPLGIVIS CALLFISFPD ISHRVIGTLL NILVNKLGFF 

51 YILTGLFFLG TTLTIAFSRY GAVYLGTTRT ARYSNFTWGS MIFTSTMAAD 
101 ILYWSLIEWA HYFTQAPFIA EHSPPTERQE WAAAYPLFHW GIIPWSFYVL. 



151 LAVAFGYMLH VKKRHTHKIS EACRPLLGAY VDGIIGEAID ICSVVGLLLG 
201 VATTFSLATP LLSLMVSLLF GISNTQLLAL ALLCVIALVY TTAVLLGTQG 
251 ISKLSRAAVY CFSTVLVFFL CAGPTVYLIE TGITAIGKML QNFFLMATWM 
301 DPSRISLQET DGTLGFPQRW TIFYWAYWIT WSVATPFFIG AISEGRTIRN 
351 TIVGGLCWGI AGTYGSFIVL GNYGLYLQTH HLLPAAYFVR AGNTPAEVII 
401 AIIQTLPCAY IVMALLAATM IAFYASTFDA LTLI IASYSQ KSVAPGEEPR 
451 QIMKSFWAVA CILLPASLIF SQSTLMHLKS LAIIAAFPLA LIMLCVVASF 
501 FKELRAEVTS 

HITS AT: 111-119 

MF Unspecified 

CI MAN 

SR CA 

LC STN Files: CA, CAPLUS 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: PRP (Properties) 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

L2 ANSWER 82 OF 82 REGISTRY COPYRIGHT 2008 ACS on STN 
RN 147387-25-5 REGISTRY 

CN Protein G (Candida albicans clone YCp50-CAGl gene CAG1 guanine 

nucleotide-binding a-subunit reduced) (9CI) (CA INDEX NAME) 
FS PROTEIN SEQUENCE 
SQL 429 

SEQ 1 MGCGASVPVD DDEIDPFLQD KRINDAIEQS LQLRQQNSKK GVKLLLLGAG 

51 ESGKSTVLKQ LKLLHKGGFT QQERRQYSHV IWCDVIQSMK VLI IQARKLK 
101 IKLDCDQPNN SLIPYKQIIL RSDPLKQIDA SVAGGTDFLN DFVVKYSEEN 
151 KNKRRLKSTG TTDIWGKDDD SNINSDAINQ ALELSLNKDS EQFTRLSIAE 
201 AIHKLWKLDS GIKKCFDRSN EFQLEGSADY YFDNVVNFAD TNYLSTDLDI 
251 LKGRIKTTGI TETDFLIKSF QFKVLDAGGQ RSVRKKWIHC FEDITAVLFV 
301 LAISEYDQNL FEDERVNRMH ESIVLFDSLC NSKWFANTPF ILFLNKIDIF 
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351 ENKIKKNPLK NYFPDYDGKP DDTNEAIKFF ETNFLKINQT NKPI YVHRTC 

4 01 ATDSKSMKFV LSAVTDMIVQ QNLKKSGIM 
HITS AT: . 333-341 
MF Unspecified 
CI MAN 
SR CA 

LC STN Files: . CA, CAPLUS, MEDLINE, TOXCENTER 

DT.CA CAplus document type: Journal 

RL.NP Roles from non-patents: PRP (Properties) 

1 REFERENCES IN FILE CA (1907 TO DATE) 

1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 
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